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Preface 


The aim of the book is to introduce the electroweak theory and the methods that 
have been developed for calculating physical processes. To this end it was decided 
to divide the book into three major parts. 


I. The road to unification 


This part gives a general view of early developments when the theory was based on 
numerous empirical rules. These topics are extensively discussed in older books on 
weak interactions, and I selected a few topics among them, such as form factors, 
CVC, and PCAC, in order to give a general impression of how the field developed. 
It should serve as an introduction to a few topics from the early period of weak 
interactions and as a guide to articles and texts. Appreciation of the first part requires 
familiarity with the methods developed at that time. The readers who find this part 
too brief or difficult may proceed to the second part, where gauge theories are 
introduced. 


II. Field theories with global or local symmetries 


This part presents field theories based on continuous symmetries and guides the 
student to the electroweak theory based on the group SU(2) x U(1). Special effort 
has been made to present it in a simple and pedagogical way. For this reason the 
chapters are short and accompanied by references that the reader or lecturer can 
consult. 


III. Experimental consequences and comparisons 


The third part of the book covers some of the exciting discoveries that took place 
in the process of verifying the electroweak theory. To date, there has been no book 
dedicated to the study of the electroweak theory and its developments over the past 


xi 
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30 years. Several textbooks cover special chapters, but an introductory overview 
for graduate students, both theorists and experimentalists, is still missing. 

The weak interactions, because they are weak, allow perturbative calculations 
that are very accurate. Since the introduction of the theory (1967), there have 
been several discoveries that have stimulated intensive research activity. These 
discoveries include 


e neutral currents 

e the charm quark, bottom quark, and top quark 

e neutrino properties and their interactions (oscillations) 
e intermediate gauge bosons 

e heavy quarks (bottom and top) 

e CP violation in K- and B-meson systems 


There is hardly a book in which all the topics are discussed together. One reason 
for this lack is that the discoveries happened every few years and older textbooks 
could not cover them. This book has been written over several years and includes 
the new topics. The author has worked on several topics and contributed to them. 

In several seminars it became clear that advanced students were asking many 
questions on how to calculate specific topics; for example box and penguin dia- 
grams, processes with mixings of Majorana neutrinos, CP-violating amplitudes, 
etc. The outcome of efforts to answer these questions is the collection of chapters 
which form the book. The answers to several such questions comprise sections that 
should help the reader to find his or her way through the ideas and the calculations 
and to go further to study the original papers. Some topics like neutral current cal- 
culations may appear standard, but are again relevant and useful for the neutrino 
oscillation and long-base-line experiments. The problems are an integral part of 
the book and help to clarify the sections of the book or present specific cases as 
examples. 

The theory has still not been completely verified, because the Higgs particles 
have not been discovered. The theory may belong to a larger grand unified theory, 
which tempted me to include chapters on future developments. The book could 
have been made longer by including more chapters on QCD, grand unification, 
supersymmetry, etc. I tried to avoid this temptation and concentrated on topics that 
have become standard themes of the electroweak theory. 

I am enormously grateful to those who generously took the time to read the 
manuscript and offered corrections and technical support. The writing of a book 
relies on the support of many friends and colleagues. 

I mention in particular Dr. R. Decker (deceased) for helpful comments in Parts 
I and II, my students and collaborators Drs. A. Bareiss, M. Nowakowski, J. and 
M. Flanz, W. Rodejohann, and my postdoc Dr. O. Lalakulich. I wish to thank the 
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guests at Dortmund University, Professors A. Datta, A. Kundu, and N. P. Singh 
for reading and improving special chapters. The attractive appearance of the fig- 
ures owes a lot to the skills of Drs. A. Samanpour and O. Lalakulich and Mr. A. 
Kartavtsev, whom I thank. For his technical support and expert advice on hardware 
and software problems I am grateful to Dr. S. Michalski. I express my thanks to 
Dr. Steven Holt for editorial improvements of the text. Parts of the book were com- 
pleted when I was visiting CERN, Fermilab, and the Institute for Advanced Studies 
(Princeton); I wish to thank them for their hospitality. 

Finally, I am greatly indebted to Mrs. Susanne Laurent for typing and retyping 
the Tex files of the manuscript with skill and patience over what turned out to be 
quite a long period of time, and for continuously contributing to the process of 
preparing and improving it. Many thanks are also due to Mrs. Beate Schwertfeger, 
who typed a good part of the first drafts when I started to work on this book. 


Part I 


The road to unification 


1 


The electromagnetic current and its properties 


1.1 Introduction 


The theory of the weak interactions, better known as the electroweak theory, was 
developed in two stages. In the first stage, a phenomenological interaction was in- 
troduced and was extended when additional experimental results became available. 
At that stage a large number of observations could be accounted for by empirical 
rules. There still remained the desire to develop a basic theory that was finite and 
renormalizable. This was achieved in the second stage by combining the electro- 
magnetic and weak interactions into a gauge theory — the electroweak theory. 

The effective current—current interaction was introduced by Fermi in 1934, 


Gr 
J2 
and was responsible for charged-current weak interactions of leptons and hadrons. 


The current was originally introduced, in analogy to electrodynamics, for the inter- 
action of the electron with its neutrino and also for the neutron—proton transition 


Het = ——= J, (x) J""(x), (1.1) 


Jax) = Py, Yu — y5)Ye + Wovnl — ¥5)Vn + Wy Yul = V5) Pu pra 
(1.2) 


Here the Ws are the fields of the fermions and the ys are the Dirac y-matrices in the 
notation of Bjorken and Drell (1965). The shortcoming of this theory is known as 
the unitarity problem and shows up in many reactions. For example, for the reaction 


Vu tE — Vve Hu 
we can calculate the cross section, which to lowest order is 


2 
GES 


T 


(1.3) 


Orol Vue > veu )= 
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with s = 4E?., 


omitted at high energies. Because of the point coupling in (1.1) only the lowest 
partial wave (angular momentum zero) can contribute to the scattering amplitude. 


where terms proportional to the masses of the leptons have been 


Then conservation of probability (unitarity) in quantum mechanics requires (see 
Problems 1 and 2 at the end of Chapter 2) 


= T 
Cinelastio S ZEZ. (1.4) 
for any scattering process. From (1.3) and (1.4) we find that the theory is consistent 
with unitarity only for 


5 5 
Eom < (3) = 309 GeV. (1.5) 
F 


Thus the theory is incomplete. 

On the other hand, why should we believe the first-order-term result for such high 
energies? It is not a matter of belief but an unfortunate fact of life that we cannot 
calculate higher-order contributions. The theory, which is based on the Hamilto- 
nian (1.1), is non-renormalizable and does not allow a well-defined perturbation 
expansion. 

At this point we fall back upon the most successful field theory at our disposal: 
quantum electrodynamics (QED). We describe in this chapter its salient features 
and we try to develop in Part II of this book, in analogy to QED, a gauge theory of 
weak and electromagnetic interactions. In fact the second stage in the development 
of the weak interactions is to construct a well-defined and renormalizable theory. 

We start with the Dirac Lagrangian for an electron interacting with the electro- 
magnetic field, 


LS V(iv" +ey"Ay — m)Y z ana (1.6) 
We think of Y as the electron field whose current 
Ju = YO) Ya) (1.7) 
interacts with the electromagnetic field 
Ly = Wiy"d, — my + ej" Ax. (1.8) 


The interaction term eWy,, WA“ fixes the vertex and the electron propagator is the 
inverse of the kinetic term. 
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Wn iey 


i 
p-m+ie 


Figure 1.1. The photon—fermion vertex and the propagator. 


Finally, the last term in (1.6) gives the interaction between photons and involves 
the electromagnetic field tensor 
dA, JOA, 


y= . 
e ox. ox” 


(1.9) 


Gauge invariance forbids a term m? A „A” that would give a mass to the photon. 
QED has been one of the most precise and successful theories in all of physics and 
has been tested to a few parts per million. 

As mentioned above, the electromagnetic current describes the interaction of the 
photon with a charged fermion. The current is a local operator 


Ju) = Vix) y YŒ), (1.10) 


where W(x) is the field for the lepton / and y, is a Dirac matrix. The current j, (x) 
is a generalization of the classical concept of a current as it appears in Maxwell’s 
theory. In classical electrodynamics j,,(x) is a four-vector with components 


jx) [corx), j= poo] 


pœ) [c,d], (1.11) 


with p(x) denoting the charge density, the vector (x) the charge flow, c the speed 
of light, and v the velocity of the charge density. The total charge of a particle is 
given by the integral 


cO= [ox jo(x). (1.12) 


The current in (1.11) is an operator that transforms like a four-vector. The fields 
occurring above are also operators that create and destroy localized particle states. 
They satisfy canonical commutation relations, which quantize the theory. The com- 
putational methods of QED can be found in many books given in the references. 
We shall assume that the reader is familiar with the methods of quantum electro- 
dynamics. 
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1.2 The current for hadronic states 


The electromagnetic current for a proton is more complicated since protons are not 
point-like particles, but have a measurable physical size formed by the cloud of 
pions and other hadrons which surrounds them. As a first attempt one would write 
the electromagnetic current for a proton in terms of free fields, 


Jy = Vy (x) yu V) = up) ypu piP T, (1.13) 


This form is ruled out immediately because it describes a point particle with unit 
charge and a Dirac magnetic moment. It obviously fails for the case of a proton, 
which has size and an anomalous magnetic moment. This implies a charge distri- 
bution and requires additional terms on the right-hand side. 

One therefore expects a more general structure, which is introduced by con- 
sidering the hadronic current as a vector operator that satisfies general symmetry 
principles. We begin by considering the interaction of the electromagnetic field 
A,,(x) with protons. The matrix element contains the term 


(p'ep). (1.14) 
Under translations in space and time J„(x) transforms as 
I(x) = e J,,(O)e7!?*, (1.15) 
where p is the operator of the four-momentum; thus (1.14) reduces to 
J EDNO pe tr = apoo, pulp) f atret, 
(1.16) 
with O,, containing terms with y-matrices, the antisymmetric tensor €,,9g, and 
momenta. The spinors u(p) and u(p’) are solutions of the free Dirac equation. 
These are the requirements of Lorentz invariance. 
Two other properties are 
(i) gauge invariance, which translates into 
q" (P'|JnO)|p) = q*a(p')O,u(p) = 0; Al) 
(ii) Hermiticity of the current 


(p'|J.Olp)* = (plJ.O |p’), 
[a(p')0,.u(p)]* = a(p)0,.u(p’), 


from which it follows that 


(1.18) 


O = YoOnYo- (1.19) 
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The above requirements limit the types of Dirac matrices and momenta which 
are included in the operator O,,. The first subset of operators is 


{On = Put Pos qu = Pl, — Pus Yur iv", Cuol}, (1.20) 


which appear in matrix elements of the vector current. In addition to the above 
operators, there are also others that contain ys or the antisymmetric tensor. They 
are produced by higher-order weak or new interactions and their contributions to 
electromagnetic matrix elements are small. For completeness we include them here 
and discuss some properties in the next section. The second subset of matrices 
contains ys, 


(lus) quays, VaYs» Surg’ ys}, (1.21) 
and the third the antisymmetric tensor, 
jer nal, e Bo edy, e By Ysqalp, eP Y galg}. (1.22) 


These terms are odd under parity transformations. Matrix elements of these opera- 
tors are not all linearly independent. For instance, matrix elements of the last three 
terms in (1.22) are reduced, by judicious use of y-matrix identities and the Dirac 
equation, to matrix elements of the second set (Nowakowski et al., 2005). 

The Gordon decomposition formula 


(1.23) 


ru U igg’ 
mp ytucp) = uh? +p pa Juco) 


2m 2m 

eliminates one term in the first subset. Similarly, the term o,” can be replaced 
by u(p’)g,u(p). Thus the matrix element of the vector current has the general 
form 


a(r Fa i SOT Fag?) i ay Fa?) Ju); (1.24) 


Gauge invariance gives an additional condition, 
F(q’) = 0. (1.25) 


The functions F; with i = 1,2,3 are Lorentz scalars and their argument must 
remain unchanged under the replacement p, —> py +k, and p, > pi, +k, with 
k„ an arbitrary four-vector; consequently they are functions of g =(p' — py’, 
which justifies the argument introduced in Eq. (1.24). We can use the Hermiticity 
condition as written in (1.19) to assure that the form factors are real functions. 
In summary, symmetry principles restrict the number and properties of the form 
factors. Some other consequences of symmetries are discussed in Chapter 2 and 
the problems given there. 
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What is the physical meaning of form factors? As the name indicates, they 
describe the structure or configuration of particles. Let us begin with an electron in 
the Dirac theory. To lowest order of electrodynamics F,(0) = 1 and F,(0) = 0. On 
replacing next the y,, term with the help of the Gordon decomposition, the coupling 
of the electron to the electromagnetic field A,,(x) is written as 


Put Pi, i ið pvq” 


eW s(x), A aY) = ew ( ee a 


) Wi(x)A"(x). (1.26) 


The non-relativistic limit produces two terms. The first term, from the sum of 
momenta, 


eii(p')u(p)A(x) (1.27) 


couples the charge density to the scalar potential because the ratio of the three- 
momentum to the mass becomes very small. The second term couples the magnetic 
moment to an external magnetic field. Considering a constant magnetic field B and 
its potential A,,(x), the interaction in configuration space is 


“eu x) 


Gon V(x) = Wy xs - B Yai, (1.28) 
2m 2m 


where Y4 are the upper components of the spinors (see Problem 2.5). The magnetic 
field is introduced as the rotation of the vector potential. Defining the magnetic 
moment as 

a 
2 
we obtain for the electron the gyromagnetic ratio g = 2. Thus a Dirac electron has 
an intrinsic magnetic moment with the natural value of 2, which can be modified 
by radiative corrections. 

Although we have started to derive a current for extended fermions, the results of 
this derivation in the form of Eqs. (1.27) and (1.29) are also valid for “point-like” 
particles, when higher-order electromagnetic corrections are taken into account. 
Indeed, the Lagrangian given in (1.6) will induce correction terms compatible with 
the symmetries of the Lagrangian. We see from (1.23) and (1.24) that F will also 
contribute to the magnetic moment via u = S[F 1(0) + F>(0)]. Both for the electron 
and for the muon, the magnetic moments have been measured very accurately. They 
have also been calculated theoretically and the agreement is very good. For the 
electron 


ji=—g5—5 with S= (1.29) 


1 
5&8 — 2)e = 0.001 159 652 209 (31), (1.30) 


with the number in parentheses denoting the experimental accuracy. Very accurate 
results exist also for the muons. The deviation from the value of 2 comes from 
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radiative corrections, which in quantum electrodynamics have been calculated pre- 
cisely (Kinoshita, 1990). 

The situation is very different for protons and neutrons. The experimental values 
are 


F\0)=1 and F,(0) = 1.79 for the proton, (1.31) 
F\0)=0 and F,(0) =—1.91 forthe neutron. (1.32) 


The changes come from the strong interactions and cannot yet be calculated. 
They are called the anomalous magnetic moments and have been measured in 
electron—hadron-scattering experiments. In addition to their values at q? = 0, the 
form factors have been measured over extended regions of the momentum-transfer 
squared and were found to decrease rapidly with g?. This behavior indicates the 
existence of a charge distribution of virtual particles around the proton and the 
neutron, with the charge density decreasing rapidly with increasing radius. The 
motion of the particles creates magnetic fields, which are manifested in the values 
of the magnetic moments. 


1.3 Parity-violating form factors 


For completeness we include additional couplings of the photon induced by weak 
interactions inside the vertex. Omitting this section will not affect the study of the 
following chapters. 

The electromagnetic force is not the only force between particles. For instance, 
the presence of weak terms changes the general structure of the electromagnetic 
matrix elements. The interaction of a photon with a particle does not mean that 
the whole process is electromagnetic, since higher-order corrections must also 
include the weak interactions. Conceptually it is easy to include these effects in the 
electromagnetic current, by dropping the restrictions that the current is invariant 
under the discrete symmetries charge conjugation C, parity P, and time-reversal 
T. Imposing Lorentz invariance, gauge invariance, and Hermiticity means that one 
must include two additional form factors (F3 and F4) and the electromagnetic 
current takes a more general form, 


v ap Vv 
= j = ! .O uv . Euvaß T 
ii(p')0,u(p) = ap’) | yu Fi(q?) +i EE maa iA Fa’) 
2m 4m 
q? 
+ Q S Yn) Sua?) u(p). (1.33) 


We know from classical electrodynamics and quantum mechanics that the fields 
transform under parity P and time-reversal T as shown in Table 1.1. 
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Table 1.1 
B a B 
> T > 
B > —B 
E = ZÈ 
> T > 
E > E 
z P e 
o > o 
3 T y 
o => —o 


From Table 1.1 we can infer immediately that o - B, an interaction defining 
the second form factor F,(q7), conserves parity and time-reversal. Similarly, the 
non-relativistic reduction of all form factors including F3(q7) and F4(q*) is given 
by 


a eee + f> =- OE 
Hy 0h na B da Bala (Ox-E) | (1.34) 


with F,(O) = e (charge), [F,(O) + F2(0)] /(2m) = u (magnetic dipole moment), 
F3(0)/(2m) = d (electric dipole moment), and F4(0) x a is called the anapole 
moment (Zeldovich, 1958). It is evident that the presence of F; leads to a parity- 
and time-reversal-violating interaction. Physical phenomena that exhibit violation 
of time-reversal are very scarce. Therefore, the observation of d # 0 will be a 
physical breakthrough. Up to now only upper limits for d have been established for 
electrons and nucleons. 

The fourth form factor F4(q°) is even under time-reversal but violates parity. It is 
frequently omitted from discussions of the electromagnetic form factors, because 
it is an off-shell form factor, in the sense that its interaction with an on-shell photon 
vanishes. This is easily seen because q? = 0 and £q” = 0 for on-shell photons. 
In addition, this form factor can appear only in matter with currents producing the 
electromagnetic fields, because for classical fields the expression Vx B—dE /dt," 
which appears in the anapole interaction, vanishes (Maxwell equation in vacuum) in 
the absence of a current. Finally, for neutral fermions, which do not carry any global 
quantum numbers, like Majorana neutrinos, only the anapole form factor is possible. 
For a more detailed treatment of the form factors I recommend Nowakowski et al. 
(2005). 
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The weak currents 


2.1 The weak currents and some of their properties 


The effective weak interaction in Eq. (1.1) was motivated by nuclear $-decays. For 
many years this was the main theoretical framework for analyzing experiments. 
As new experimental discoveries became available, the form of the interaction was 
maintained but the current J,,(x) was enlarged to incorporate the new observations. 
At the end of the sixties the charged current J i (x) included a leptonic and a hadronic 
term, 


tafi f 
Ji =F) + hŒ). (2.1) 
The leptonic part of the current is 


N(x) = PO — yE A) + POA — ya), 2) 


with the first term corresponding to the electron and its neutrino and the second term 
to the muon and its neutrino. Its space-time structure has a vector part analogous 
to the electromagnetic current and an axial part introduced after the discovery of 
parity violation. A direct calculation using the currents in (2.2) gives the u-decay 
spectrum, which is in good agreement with experiment. It also gives the decay rate 
of the muon as 


Gum, 
19273 ` 


From the observed decay rate and the mass of the muon the constant G „ is deter- 
mined to be 


T(u > e+ Ve + Vp) = (2.3) 


Gy = (1.166 32 + 0.000 04) x 107% GeV~?. (2.4) 


This determination includes the effects of radiative corrections, which in the 
electroweak theory are finite and can be calculated precisely. 
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The hadronic current consists of several parts determined by detailed analyses 
of hadron decays. For instance, the decay of a neutron, n > p+e7 + ve, is well 
described by the matrix element 


(plJ}In) = (pIViiln) — (pIAl In), (2.5) 


where we can decompose the matrix elements in terms of form factors. Lorentz 
invariance gives the general expressions 
du 


(P+ pu +h fe uo) (2.6) 


t — 5 
(plV iln) = ap) (svvy PRE T 


and 


10, .q” qu 
ha — a AZT. 
aM ys + aiin uo) (2.7) 


(plA} In) = ao (savs + fa 
where p, and p, are the momenta of the neutron and proton, respectively, with 
qu = Pi, — py. The functions gy, fy, and hy are vector form factors describing 
the effects of strong interactions in the hadrons. Similarly ga, fa, and ha are axial 
form factors. General symmetries, like charge symmetry and time-reversal, limit 
the form factors and demand that hy = fa = 0 (see Marshak et al., 1969, p. 314). 
At zero momentum transfer, the vector form factor gy was precisely determined and 
it is strikingly close to 1, while g4 is about —1.23. An explanation was proposed, 
namely that the strangeness-conserving part of vi has the isospin content 


tT—yplijyp2 —- yt Fal yp agq2 _. at 
Vi =V! +iV? =: V} and Al = Al +iA? =: At, (2.8) 


where 1 and 2 denote the first and second components of isospin. This means that 
the charges 


(ie = J Vi(x)dx (2.9) 


are the same isospin generators as those occurring in the strong interactions and 
are therefore conserved. This rule is called the conserved-vector-current (CVC) 
hypothesis. The TÝ form an algebra that closes under commutation relations 


[Ti TI] = ie T. (2.10) 


As a consequence, the commutator of T+ with T~ produces the third component of 
isospin. In the late sixties T? had not yet been observed to mediate transitions with 
the strength G; there was no weak neutral current. But such an operator already 
existed in the electromagnetic current. The electromagnetic current consisted of 
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two parts, 


Ji (Xx) = v3 (x) + —= (2.11) 


5 VEX), 
with væ) being the third component of isospin and v3 an iso-scalar current 
transforming as the eighth component of SU(3). It is evident that there is a rela- 
tion between the weak and the electromagnetic currents, since the vector part of the 
weak current and the isovector part of the electromagnetic current form an isotriplet. 
The form of the interaction in (1.1) defines a universal coupling for leptonic, semi- 
leptonic, and non-leptonic decays. Once the coupling constant G has been deter- 
mined, as in (2.4) from the muon decay, it can be used to translate the isotriplet 
hypothesis into relations between electromagnetic and weak matrix elements. In 
Section 11.3 we give a consequence of the isotriplet hypothesis and the cross 
section for neutrino-neutron quasi-elastic scattering. Expressions for the currents 
in terms of quark fields are given in Chapter 3. 

Since Vi is an isospin current, its matrix elements at zero momentum transfer 
are simply given by Clebsch—Gordan coefficients. The strength gy is determined 
in nuclear -decay as well as in the elementary decays 


+ 


nt — n? +e? + ve, 


= (2.12) 
n—> pte +e. 


In all these cases the charge current connects states with the same isospin T, but 
different components 73. At zero momentum transfer the relevant matrix element 
is 


(U, B + 1V7 OU, B) = 1, (2.13) 


for I = . The value of gy is extracted from -decay and its value is found (see 
Equation (9.28)) to be 


gv = 0.9740 + 0.0003 + 0.0015. (2.14) 


This precise value includes radiative corrections, so its deviation from unity is 
significant. 

Is the small difference of 2.6% a drawback of the theory or is there another com- 
ponent of the current? The discrepancy was explained by the observation that the 
hadronic current Vy does not generate only the isospin group SU(2), but contains 
other pieces responsible for strangeness-changing decays, like 


A? — peT + 7e, 


2.15 
K+ — m + et + ve. ( ) 
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Thus the hadronic current is the sum of a AS = 0 term anda AS = 1 term, 


Vi = cos 0e VS + sind, VASTI, (2.16) 
The two terms are interpreted as two components of the current orthogonal to each 
other and connected through the mixing angle 0.. The first term contains the isospin 


current that appears in (2.8), 
AS=0 _ yl 1:72 
V; = V, av (2.17) 


The second component produces strangeness-changing transitions and it has, in 
SU(3), the form 


Vise! = Vi+iV?. (2.18) 


The matrix elements of V‘5=° and V,4°=! can be estimated accurately. The con- 
clusion from numerous experimental estimates gives the mixing angle 


sin 6. = 0.220 + 0.002. (2.19) 


In this way universality is restored, since the sum of the squares of the hadronic 
couplings reproduces the coupling observed in muon decay. In addition the dis- 
crepancy of gy from 1 is understood. The angle 6, is called the Cabibbo angle. 
The appearance of the Cabbibo angle will become more evident in the context of 
the Cabibbo—Kobayashi—Maskawa matrix, which enters the full Lagrangian of the 
weak interaction. 

Finally, we mention one more difference between electromagnetic and weak 
interactions. The electromagnetic amplitude for the reaction ete~ — utp has 
the amplitude 


M = ie? 22 pt, (2.20) 
q 
with an explicit photon propagator and the product of two currents, like 
Ja =U t+ I (2.21) 


The hadronic current was discussed in Chapter 1 and the leptonic current has a term 
for each charged lepton like 


ic = Why, Wy. (2.22) 


On comparing (2.20) with (1.1), we note that the propagator is missing in (1.1). It 
should have been there in the form 


2 2 —İig uv 
Ape e (2.23) 
ý q? = My 
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if the weak interaction were mediated by the exchange of a particle of mass Mw and 
coupling strength g. At very low energies, however, at which most of the decays 
take place, q? « Ms, and 


2 G 
a Ais — i8 = 1gy»—=- 
My 2 


Thus the form in (1.1) is indeed a very good approximation. 


(2.24) 


2.2 The partially conserved axial current 


A second property of the weak currents deals with approximations that are possible 
in matrix elements of the axial current. The charged pions decay weakly into nV, 
pairs with a hadronic matrix element 


OAŽO EA) = i fxg? que". (2.25) 


Here q, is the four-momentum of the pion and fx defines the decay coupling 
constant. The form of the matrix is dictated by Lorentz invariance. The coupling 
fr(q” = m2) is measured with the pion on the mass shell. We can take the diver- 
gence of this matrix element and obtain 


q OJA OT) = i frg’. (2.26) 


We conclude from this relation that the axial current is not conserved, because 
neither fr nor Mny is zero. However, it may be approximately conserved because 
2 is a small number relative to the mass squared of all other hadrons. Thus, 
for many low-energy processes that involve the axial current with four-momentum 


du; it is possible to replace the divergence of the axial current by the pion field 


gam 


aM Al = fam’, (2.27) 


and, in addition, after we have extracted the pion propagator, the reduced matrix 
element is a slowly varying function of q? provided that q? < m2. Equation (2.27) 
is an operator relation and holds for all matrix elements. We must be careful, 
however, to replace the pion field by its source ji} = (O? + m”)d’ and substitute 
for the pion—nucleon vertex the coupling 


(pl intln) = iV 2¢nnnti(p')ysu(p). (2.28) 


Several applications have established that the matrix elements of the axial current 
and its divergence can be treated this way. We shall describe here an application 
of this procedure to the matrix element in B-decay, which leads to a remarkable 
relation known as the Goldberger—-Treiman relation. We present the derivation in 
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qu í 
AVV- =*= — iV 29nnn1's 


ifn 


Figure 2.1. Direct coupling of the axial current to a particle of zero mass. 


two ways in order to emphasize that the second method is based on an underlying 
symmetry. Consider the matrix element 


- q 
(pIAj|n) = a(p’) (sag?)rurs Rg halq?) ys) u(p). (2.29) 
Taking the divergence of both sides of this equation gives 
halq., 
(plð" Aj |n) = (emee iaer aM nat) inp )ysu(p). (2.30) 
On the other hand, from (2.27) 
1 
(pla“ AÑ In) = famhlplot in) = fam; n gay m Pld |n) 
m? , 
= fr iv 2g nnnii(p')ysu(p). (2.31) 
—q4 +My 


Taking the limit q? > 0 with m2. Æ 0 in the last two equations, we obtain 
V2M ga = 8N fr (2.32) 


This is the Goldberger—Treiman relation. For the experimental values of the 
coupling constants it holds at the 10% level. It is a remarkable relation, relating 
the pion—nucleon coupling constant to two couplings of weak interactions. 

There is a second way of looking at partially conserved axial current (PCAC). 
The meaning of PCAC is that the actual world is not far from the limit in 
which the axial currents are conserved at the expense of having zero-mass 
pions (Mr = 0, fr #0). In this approach we can still define fx and ga through 
Eqs. (2.25) and (2.29). Because the axial current is now conserved, Eq. (2.30) 
becomes 


2 halq’) 


=0. 2:33 
M (2.33) 


2Mga(q°) +4 


In the limit of q? — 0 the second term of Eq. (2.29) does not vanish but con- 
tributes the amplitude 


induc iV 2ennnii(p’)ysu(p) (2.34) 


18 The weak currents 


shown by the diagram in Fig. 2.1. The amplitude has a pole at q? = 0 and its 
divergence gives 


hala? 
g sa J —J/2 fı nnn + terms proportional to q?, (2.35) 


which, together with (2.33), gives again the Goldberger—-Treiman relation. This 
demonstrates that the form factor h alq?) is dominated at small momentum transfers 
by the pion pole. 


2.3 Regularities among the forces 


The subjects covered in the first two chapters represent basic topics developed long 
before the electroweak theory. They strongly suggest that the weak force is not an 
isolated phenomenon, but one intimately connected with the other forces of nature. 
The isotriplet hypothesis clearly states that the isovector part of the electromagnetic 
current and the vector part of the weak current for AS = 0 transitions form an 
isospin triplet. In addition, it states that the charges T~ are the same generators as 
those of strong isospin. We note that operators of the three types of interactions are 
related. The isotriplet hypothesis also posed a problem: that of explaining why the 
neutral member of the multiplet did not occur in the weak interactions by itself, but 
only through electromagnetism. This question was answered with the discovery of 
weak neutral currents. 

The hypothesis of PCAC relates couplings of the weak interactions to the pion— 
nucleon coupling constant through the Goldberger—Treiman relation. In another 
application, PCAC combined with equal-time commutation relations, it is possible 
to calculate the deviation of g4 from 1 as an integral over the pion—nucleon cross 
sections. 

Consequences of PCAC hold at the 10%—20% level. They are understood to 
hold because the mass of the pion is small in comparison with the masses of other 
hadrons. That is, there is an underlying symmetry, which is broken by the small mass 
of the pion. The previous remarks provide a strong motivation to search for a closer 
connection of the weak, electromagnetic, and perhaps the strong interactions. The 
successful theory which unifies the weak and electromagnetic forces is studied in 
the following chapters. The electroweak theory is so far in excellent agreement with 
experiment. It made many predictions that have been confirmed by experimental 
data. Finally, the reader should keep in mind that the theory must also provide a 
natural explanation of the empirical rules described so far and others to be described 
in the following chapters. 
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Problems for Chapters 1 and 2 


1. The scattering of particles 
a+b—c+d 


is described by the amplitude 


ô 1) 


1 (ne?! 
iW=— we + I)——.—— P,(cos 6), 


aes 
2i 
where 7; and ô; are real functions and k is the magnitude of the momentum of particle 
a or b in the center-of-mass system. ô; is the phase shift and n; is introduced to describe 
inelastic scattering: for elastic scattering n; = 1 and for inelastic scattering n; < 1. 
(a) Prove the optical theorem and show that 


27 
Git = pet + DIL — m cos(28;)]. 


(b) Show that, for elastic scattering, 


ne 


ið 1 2 
2i | 


4r 
oa a me +1) 


(c) Show that, from (a) and (b), it follows that 
T 
Ga =a Det + 1)(1— np). 


2. Using the result from Problem | (part (c)), show that the cross section for the reaction 
Vute > pL ve is limited by 


TT 
o(vVu te —> u +Ve) < JEZ’ 
cm 


where Eem is the energy of the v, or the e~ in the center-of-mass frame. Take into 
account that it is an / = 0 scattering and that there is a spin factor of (2s + 1). 

3. From the Hermiticity of the electromagnetic current, show that F\(q*) and F,(q7) are 
real. 

4. From time-reversal invariance, show that F; q and F3(q*) are real. 

5. By considering the non-relativistic limit of the Pauli interaction, 


1 pey v 
ze vou YE" » Puy = 0, Ay — OV Ap, 


give a physical interpretation of the term containing F,(q*). Express F>(0) in terms of 
the proton’s anomalous magnetic moment. 


Reference 
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Particle Physics (New York, Interscience) 
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3 
The quark model 


3.1 Introduction 


The quark model arose from the analysis of symmetry patterns observed when 
particles were grouped together according to their spin and parity. When the eight 
mesons with JP = 0” are displayed in a strangeness (S) versus isospin (/3) plane, 
they form the octet of Fig. 3.1. An identical pattern emerges for the eight vector 
mesons with J? = 1~ also shown in Fig. 3.1. The vector mesons are excited states 
of the particles in the J? = 07 octet. The symmetry pattern was interpreted as a 
generalization of the isospin group SU(2) to the group SU(3) which incorporates 
both isospin and strangeness. Gell-Mann and Neeman (1964) proposed that the 
eight baryons with J? = ue also belong to an octet of SU(3), thus establishing a 
parallelism between meson and baryon states. Finally, many static properties of the 
particles exhibit the SU(3) symmetry. 

Since the fundamental representation of the group SU(3) is a triplet, it is natural 
to try to interpret the hadronic states in the octets as bound states of triplets or of 
triplets with antitriplets. If the fundamental fields also carry baryon number, the 
product of triplet & antitriplet would be mesons with zero baryon number. The 
product of three triplets carries baryon number and contain octets and a decuplet 
as was required by the observed states of baryons. This is the quark model of 
Gell-Mann (1964) and Zweig (1964). 

The spectroscopy of particles and their SU(3) properties are covered in many 
books, for instance in the references at the end of this chapter, and we shall concen- 
trate on symmetries of the currents, which are more relevant for the electroweak 
theory. 
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Figure 3.1. Meson octets with J? =O” and J? = 17. 


3.2 Current algebra 


The original model contained a triplet of quarks 


u 
s 
with the quantum numbers 
Quark Q/e I b B Y 
u 2/3 1/2 1/2 1/3 1/3 
d —1/3 1/2 dy 1⁄3 1/3 
s —1/3 0 0 1/3 —2/3 


where Q, I, 3, B, and Y are the charge, isospin, third component of isospin, baryon 
number, and hypercharge, respectively. The quantum numbers of the quarks satisfy 
the Gell-Mann-Nishijima relation, 


Y 
Q=T3+ 7’ (3.2) 


a rule that was established originally for hadronic states. 

Next we shall rewrite the currents in terms of quark fields and formulate several 
of their properties. This approach is motivated by the fact that several properties of 
the currents and their couplings to hadrons are explained as symmetry properties 
of SU(3) and in many cases they are identical with predictions of the simple quark 
model. In fact, for a long time the quark model was used as a tool for abstracting 
properties and relations, whose validity is more general in field theories. In the 
early days the quark model was supplemented with strong interactions mediated 
by vector mesons in order to verify the validity of the results in theories with 
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interactions. Among the regularities are relations between the masses of particles 
within a multiplet and regularities of the currents. We describe below conservation 
laws of the currents and outline the algebra of currents. 

The electromagnetic interaction of leptons is 


Lin = eja ŒA"), (3.3) 
with 
Jux) = ex) ype(x) + AX) Vp MO) +++ (3.4) 


Similarly, we can construct the electromagnetic current of quarks, 


jax) = y egdi Yudi 


Di. 1- 1_ 
= zÜŪÜy,u — —dy,d — zus 


3 3 
1 = 1 - 
= z ËY — dy,d) + gvm + dy,d — 25y,5). (3.5) 


In the last equation we separated the current into two parts, in order to show explicitly 
its SU(3) content. Let A“ be the Gell-Mann matrices for SU(3), then we define vector 
and axial currents 
G - AS 
FiO) = GO%u >a) (3.6) 


À 
Fis) = Ẹya ys 7410) (3.7) 


with q given in Eq. (3.1). Then the electromagnetic current in (3.5) is 


sem . L; 


which reproduces Eq. (2.11). 
With the quark currents it is also convenient to study the symmetries of the 
Lagrangian 


Lquark = Lo + Le mass; (3.9) 
with 
Lo =ig dq and Linass = Miu + madd +m,5s. (3.10) 


We now state the invariance properties under global transformations. £o is 
invariant under the transformation 


q—q =Uq with U =e)? (3.11) 
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where 6, are constants, i.e. independent of space and time. Such a transformation 
is called global. In proving the invariance under unitary transformations recall that 
e^eB = eA+8+314.81, provided that A, B commute with [A, B]. The term Lynass is 
not, in general, invariant under the global transformation. It becomes invariant only 
when all quark masses are equal: 


m = M, = Ma = Msg. (3.12) 


A consequence of the symmetry is the conservation of all vector currents. Consider 


0 0 
gar FE prud, p)} = garie), (3.13) 


which in momentum space becomes 


a(x, p)[p — pld(x, p) = (my — müx, pdx, p). (3.14) 


This current is conserved when the two masses become equal. 
Let us try to repeat this argument for axial transformations: 


gq — q! =Vq = ele P59, (3.15) 


Now the kinetic term Lọ is again invariant, but the mass term is not invariant even 
when the masses are equal. The Lagrangian is invariant under global ys transfor- 
mations when all quark masses are zero. In fact the axial current is not conserved 
and its divergence is 


0 
ayn dust) = a(x, p’)[p — plysd(x, p) = (my + maii(x, pVysd(x, p). (3.16) 


When the Lagrangian is invariant under the axial transformations (3.15), all quark 
masses must vanish and the axial current is conserved. The two cases are examples 
of Noether’s theorem, which states that, for every continuous global transformation 
that leaves the Lagrangian invariant, there is a current that is conserved. 

Relations of the second class are abstracted from the quark model and establish 
equal-time commutation relations of currents. 

In quantum field theory the quark fields satisfy the following equal-time canon- 
ical anticommutation relations: 


[P Qe} nage = er HOE — 2X), 
fac), Ge} og ={@ abo} =, 


+ 
X0 =X0 


(3.17) 


where t and t’ run from 1 to 12, i.e. there are three flavors and to each of them 
there correspond four spinor components. One can derive equal-time commutation 
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relations for the SU(3) currents using the identity 
[AB, CD] = —AC{D, B}+ A{C, B}D — C{D, AJB +{C, A}DB. (3.18) 
The final result, outlined in Problem 2, is 
[ii@), Jo] 
On integrating this equation over three-dimensional space we arrive at 


[O E] = tf J), (3.20) 


— — f” JEO — x’). (3.19) 


1 
Xo=X0 


where Q? (xo) is the charge corresponding to the vector current, defined by 


oo) = f ax ihe, 6.21) 


It is now straightforward to derive from (3.20) the commutation relation for the 
charges: 


[Q°,.0"] =i f” Q". (3.22) 
Similarly, we can repeat the above steps for the axial current to obtain 

[2°, 05] =i fas, (3.23) 

[25, Q5] = if” O°. (3.24) 


We see that the vector and axial charges form an algebra that closes under commu- 
tation relations. If we define left- and right-handed charges 


1 
Oi 5 (OF Q$), (3.25) 
they also satisfy the algebra 
[0% On] =0, 
[0i 01] =i 0i, [QR Ox] = if” QR (3.26) 


It says that the left-handed sector does not communicate with the right-handed 
sector. Thus each sector by itself forms an SU(3) algebra. The group now is 
SU(3)L x SU(3)r, known as the chiral group. The theory based on the chiral group 
and the approximation that the u and d quark masses are very small, relative to 
those of the other quarks, is known as chiral theory. The chiral theory can explain 
many of the regularities observed at small masses and momenta. We shall have the 
opportunity to remark on the implications of such a theory in Sections 5.2 and 15.6. 
We can also express the weak hadronic current given by 


ji? = ĀYa(l — y5)d cos 0 + Āya(1 — ys)s sin Oc (3.27) 
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in terms of the octet currents (3.6) and (3.7). Defining 


Vi = tn Al = dus (3.28) 


u = 
one obtains 


ju = [Vz +iV,) — (4, +147) Joos 6. + [(V,. + iV.) — (Az, + iA;,)]sin 6 
(3.29) 


and we recover the AS = 0 part of (2.17) and the AS = 1 part of (2.18). 

Expressing the hadronic currents in terms of quark fields sets them in one-to- 
one correspondence with the leptonic currents. The equal-time commutators give 
non-linear relations between observables, thus determining their relative strengths. 
Prominent among them are several sum rules that are valid at small and large 
momentum transfers. 


3.3 Quantum chromodynamics 


In spite of its successes, the quark model was received with a lot of skepticism 
because there was no experimental evidence for particles with fractional charges. 
To some authors this remained a mystery; to others the quarks remained a mnemonic 
for deriving useful rules. An additional objection concerned the fact that there was 
no theory describing the strong interactions among quarks. The attitude changed 
in the late sixties when inelastic electron—nucleon-scattering experiments provided 
evidence for point-like constituents, partons, within hadrons. Furthermore, corre- 
lations between electron- and neutrino-induced reactions provided evidence that 
the partons carried the quark quantum numbers. These topics will be studied in 
detail in Chapters 10 and 11. The final result was the formulation of a theory for 
the strong interactions whose fundamental fields are the quark and vector mesons — 
the gluons. 

The theory of strong interactions is known as quantum chromodynamics or, in 
short, QCD. There are strong indications that each quark carries an additional quan- 
tum number called color; hence the name of the theory chromodynamics (color = 
chroma). The choice of names of the colors as red, white, and blue, or another 
triplet of names, is arbitrary but the fact that they are three in number is important. 
The quarks interact with each other by the exchange of vector bosons that change 
the colors of the quarks (Gross and Wilczek, 1973; Politzer, 1973). 

We include in this section a few introductory remarks and discuss topics related 
to QCD in various sections of the book. 


3.3 Quantum chromodynamics 27 


! Aa 
19s Yu D 


Figure 3.2. A gluon-fermion vertex. 


The theory of strong interactions is in many respects similar to QED. We write 
a quark of a specific flavor as a triplet of color SU(3): 


qr 
q(x) = | qw 
qb 


The theory also contains eight vector mesons — the gluons — coupled to quarks. 
There is again a vector vertex, with the new coupling constant g, for the strong 
interactions, and a à“ matrix acting on the quarks. The effective coupling constant 
for the strong interactions, 
85(P) 
a.(p) = An ’ 

is now large and calculations with the exchange of a single gluon are neither accurate 
nor useful. One considers the cumulative effect from the exchange of many gluons, 
which modify the coupling constant, making it a function of momentum carried by 
the gluon. 

The strong coupling constant has a remarkable property. At small momenta it 
is very large, binding the quarks into hadrons, so quarks cannot be separated as 
asymptotic particles. At large momenta of the gluons, the strong coupling constant 
becomes small, making perturbative calculations possible. As a consequence there 
are two types of calculations in QCD. One of them involves large momenta, for 
which perturbative summations of many gluons are possible. In a second class of 
calculations, numerical simulations of QCD replace continuous space-time by a 
finite but large four-dimensional lattice for space and time. Sophisticated computer 
programs have been written for handling gluon and quark fields on the lattice. 
These are non-perturbative calculations that should produce, among other results, 
confinement. 

Throughout this book we shall study decays and reactions that involve both 
strong and weak interactions. The weak interactions of hadrons will be expressed 
in terms of the quark substructure by writing the currents in terms of quark fields and 
estimating or calculating matrix elements of quark operators for transitions between 
hadronic states. The success of these methods varies from process to process. This 
is a still developing field of research, as will become evident in several sections of 
this book. 
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Problems for Chapter 3 


1. The lowest-lying baryon states are built with three quarks with L = 0. There are ten states 
with J? = ac, To this decuplet belongs A** (uuu). Construct the wave function of Att 
with space, spin, and color contributions so that it obeys Fermi statistics. Finally, argue 
that color is necessary in order for the Pauli principle to be preserved. See Kokkedee 
(1969). 

2. The weak vector current builds, together with the electromagnetic current, an algebra. 
For a better understanding we consider the SU(2) algebra. The generators of the group 


SU(2) are the matrices t!, t?, and 1°, with the following property: 


[ey ti] = ieit. 


We now define the currents 
| ied 
a oes 
Ja = WIZ Yua), 


which satisfy an algebra. 
The fermion fields obey the canonical quantization 


faia), ge(y)} = brr ô OR Y) 


X0=y0 


and 


E), de Moan = E, LO = 0. 


xo=y0 
(a) Show the following relation: 
[Part TaT] = 5 (Pas To}les 2] + [ra Talie, 1}, 

where I’, and Ig are arbitrary Dirac matrices. 

(b) Using the identity 
[AB, CD] = —AC{B, D} + A{C, B}D — C{D, AJB + {C, A}DB, 
show that 
[al wae), 14 Oy 


= [rg O) Gor ge(x)} SOR — Dlr: 
(c) It follows now that 
+ abc = 


aOR Ns = T° TE 
iyu IO), Gyo >a) = ~ie G2) Yuq) (x — y). 


x0=Yo 
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Part II 


Field theories with global or local symmetries 


4 
Yang—Mills theories 


4.1 The Yang-Mills field 


The successful and simple theory which unifies the weak and electromagnetic 
interactions is based on the group SU(2) x U(1). We develop the theory in several 
steps. First we describe, in this chapter, the main features of a gauge theory. Then we 
will describe a theory containing only electrons and the corresponding neutrinos. 
Finally, the theory is extended to incorporate hadrons. 

The structure of a Yang—Mills theory is almost completely determined by the re- 
quirement that the internal symmetry transformations of the fields can be carried out 
independently at different space-time points. In other words, the theory is invariant 
under local transformations. Let Y be a multiplet of n Dirac fields. The multiplets 
belong to representations of the group SU(N). We define a transformation of the 
fermion fields by 


bs W's oui, ome 

with U a unitary matrix. We represent U by 
US, (4.2) 
with j = 1,2,..., N? — 1, where A; are the generators of the group with A; = i 


and the a; are real. The generators are familiar in simple cases. When the fermions 
belong to the fundamental representation of SU(2), the A; are the Pauli matrices; 
for SU(3) they are the Gell-Mann matrices. We distinguish two cases: 


(i) when all œ; are constant, we call it a global transformation; 
(ii) when the œ; = a(x) are functions of x,,, we call it a local or gauge transformation. 


33 


34 Yang-Mills theories 


The free Dirac Lagrangian 
L=iV(x)y,d" V(x) = iW (x) p(x) (4.3) 


is invariant under global transformations. 
If we allow a; to be a function of x, then (4.3) is no longer invariant. In fact, the 
Lagrangian transforms into 


! Ty; i 
L — L Ship, a + eron |W (4.4) 


with a’ (x) = da j(x)/dx,. Following Yang and Mills, we introduce a set of vector 
fields BY (x) and couple them to the currents as follows: 


L = Wiy,[a" + A;BY]Y + Le, (4.5) 


where A; is a set of matrices still to be determined and Lg is a function of the 
Be terms only. Each vector field is characterized by a Lorentz index u and an 
internal symmetry index j. We now demand that £ remains invariant under the 
transformations (4.2) with a; a function of x; this will require that B A transforms 
in such a way as to cancel out the additional term in (4.4). Let Ê ; be the transformed 
vector field. Then, for £ to remain invariant, 


f) x 
U*+—U + Ut A,UB? = A; Bi (4.6) 
Pay 
must hold (see Problem 1). Since the 4; terms form a complete set of N x N 


traceless matrices, we can attempt to write 


i 


Ax = orks (4.7) 


the imaginary i is there because the A, terms and the B terms are Hermitian. 
Considering infinitesimal transformations, 


U = 1+ sash; (4.8) 
with 
te 4 TE 
sais zà | = ifijk Ate 
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(4.9) 
Tr[AjAj] = 2ôij, 
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and _f;;, are the structure constants of the group. For the infinitesimal transformation 
we can solve for Ê! in (4.6). A convenient method is to rewrite (4.6) as 


0a; 


edi BY = Ut ea, UBE + i (4.10) 


OX, 


and then expand the unitary matrices to first order in a! and use the relation in (4.9) 
to obtain 


By = Be + fioi B? + = o (4.11) 
It is convenient to introduce a covariant derivative, 
i 
D” = 3” + 50h iB; (4.12) 
and rewrite the fermion part in (4.5) as 


Covariant derivatives are useful in generating gauge-invariant Lagrangians. A 
Lagrangian invariant under global transformations becomes locally gauge-invariant 
when all ordinary derivatives are replaced by covariant derivatives. In quantum elec- 
trodynamics this replacement is the well-known minimal-substitution law. 

Next we must construct Lg. It must be Lorentz-invariant and invariant under 
B — Ê. It must also contain the kinetic term of the B, fields. In analogy to the 
procedure of obtaining gauge-invariant field strengths in electrodynamics, we define 


Fi’ = 0° BY — a" B? + ef ijn Bi By. (4.14) 
If we introduce the vector notation 
BY = (BY, BY,..., BL), (4.15) 
where k = N? — 1 and 
(A x B); = ijk Aj Be, (4.16) 


we can write (4.11) and (4.14) as 


FH" = 9’ BY — a" BY + eB" x B”. (4.17) 
The last term in (4.17) does not occur in electrodynamics and is introduced to 
assure that F“” transforms as a vector under gauge transformations. A reason for 
introducing a generalized Fy is given in Problem 1; in the same problem we 
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discuss the gauge invariance of Lg. We can now build a scalar Lagrange function 
for the B” fields, 


1 > > 1 
Le = = Fi = gE Fiuv (4.18) 


A theory with £g alone is called a pure Yang—Mills theory. 
For Lagrangians invariant under symmetries, we can also define currents of the 
original Lagrangian, which are given by 


Jx) = -a Į = itor wer). (4.19) 


a 
aL Y) 2 
The invariance of the theory implies that the currents are conserved. We note that 
these are the same currents as those we introduced in Chapter 3. 
To sum up, we constructed a theory that is invariant under gauge transformations. 
The complete Lagrangian is 


L = LF + Ly. 


We found that the invariance requirements are fulfilled by introducing vector fields 
coupled to conserved currents. 

Such a theory is a candidate for particle physics. It describes the interaction of 
massless fermions with massless gauge bosons. It possesses a symmetry that can 
be SU(2), SU(3), or a larger unitary group. The case of SU(3) is, in fact, realized in 
Nature as the theory of strong interactions. There the vector bosons are the gluons 
coupled to quarks and the symmetry is the SU(3)-color. The color symmetry remains 
unbroken. The electroweak theory is more complicated because it contains masses 
for the quarks and the gauge bosons. It is a broken symmetry, to be developed in 
Chapters 5-7. 


4.2 Gauge invariance in scalar electrodynamics 


The electrodynamic field is described by the four-vector 
A,(x) = (®(x), A(x), (4.20) 


whose components are the standard scalar and vector potentials. The electric and 
magnetic fields are now determined by 

> > 0 A > > > 

E = -V Ao — F B=V xA. (4.21) 
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However, to one set of fields (E , B) there correspond many potentials A,,. The 
primed potentials obtained by the gauge transformation 
dA(x) 

əx” 


Al œ) = A) + ; (4.22) 
with A(x) an arbitrary scalar function, give the same E and B. If someone solves 
a problem with A,,(x) and somebody else does it with Aj, (x), both should get the 
same physical result. In general, only those quantities which are invariant under 
gauge transformations have physical meaning. Gauge invariance has far-reaching 
implications for the theories, as we discuss in the following chapters, and clever 
choices of gauge lead to substantial simplifications of problems. 

Here we use gauge invariance to discuss the degrees of freedom for the electro- 
magnetic field. For the pure electromagnetic case 


1 


L= E A (4.23) 
with 
Fuy = 0,Ay — DA- (4.24) 
The equation of motion 
Onl =0, (4.25) 
when written in terms of A,,, becomes 
3 ð Ag — 0,0" Ag =O, (4.26) 


It is well known that the E and the B fields satisfy a wave equation, but A,, does 
not. In order to recover a wave equation from (4.26), we impose the condition 


d,A" =0 (Lorentz gauge). (4.27) 


We used the gauge freedom to obtain this result, but still we did not exhaust all 
possible gauge transformations, because any gauge function x (x) that satisfies 


ə On x(x) = 0 (4.28) 


is still consistent with (4.27). We take advantage of this freedom in order to show 
that a photon has only two degrees of freedom. 
A free photon is represented by a plane wave 


AŒ) = epe ™, (4.29) 
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where €, is called the polarization vector. By substituting (4.29) into (4.26), we 
find k? = 0, or m = 0, and from (4.27) we find 


k£” =0 (Lorentz gauge). (4.30) 


We choose a coordinate system with the z-axis along k and decompose €, into 
longitudinal and transverse parts: 


Ay(x) = (el + ene. (4.31) 


From (4.30) we conclude that el, is proportional to k,,. Instead of the electromagnetic 
field A,,(x), we can choose another one given through a gauge transformation with 


x(x) = ice“, c= constant. (4.32) 


The new field is 
Aœ) = (el + etje ™ + ck,e™. (4.33) 


By an appropriate choice of the constant c, we can eliminate k,,, i.e. we can gauge 
away the longitudinal degrees of freedom. Therefore a free photon has only two 
degrees of freedom. 

The argument fails for a massive A,(x) field. The addition of a mass term 
5 RA nA” to Eq. (4.23) breaks the gauge invariance of the theory. In this case the 
equation of motion 


0, FH + pA” =0 (4.34) 
implies 
ud, A” =0. (4.35) 
For u? £0, A y satisfies the Lorentz condition, which again eliminates one degree 
of freedom. But now we cannot repeat the steps between Eqs. (4.28) and (4.33). 
Therefore a massive field has three degrees of freedom. 


Next we study the interaction of a photon with a charged scalar field: scalar 
electrodynamics. The Lagrangian is 


1 
L = FFF + (DY9)"(Dud) — VO). (4.36) 


Here D” = (0" + ie A”) is the covariant derivative, in agreement with the rule of 
replacing ordinary derivatives with covariant ones. We represent the field as 


1 


(x) = T 


($x) + ig2(x)) (4.37) 
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and introduce the potential 


V9) = -W99 + MPP. (4.38) 
This Lagrangian is invariant under the local transformation 


A” (x) — A" (x) + 0% a(x), 
pax) — gx), (4.39) 
b*(x) — g*a), 


where w(x) is an arbitrary real function. The photon is again massless and 
carries two independent degrees of freedom. This follows from the same argu- 
ments as in the free-photon case. First we can go to the Lorentz gauge, which 
again simplifies the equations of motion both for A,,(x) and for the scalar field 
(x). Then, since the photon is again massless, we can introduce a new gauge 
transformation satisfying (4.28) and eliminate the longitudinal degrees of freedom. 
In gauge theories, masses for the gauge bosons are introduced, not through the 
ad-hoc procedure of the previous paragraph, but through spontaneous breaking of 
the symmetry. We study this topic in Chapter 6 and return to scalar electrodynamics 
in Section 5.3. 


Problems for Chapter 4 


1. Consider the fermion Lagrangian in Eq. (4.13). 
(i) Show that invariance under the local transformation (4.2) requires that the covariant 
derivative satisfies 


Di =U Dw. 


(ii) Show that this result, together with the definition of D,,, implies the transformation 
property for B” given in (4.6). 

(iii) Show that the Hermitian quantity Fa» = —i[D,, D,] is the field tensor whose 
transformation under local transformations is 


Fi, = UU. 
It is now easy to build an invariant term given by 
1 i 
Lym = 4 (Er) a 


2. Show that, under local transformations, the field-strength tensor F E transforms as a 
vector on the index i. The result holds including terms linear in ¢;(x), i.e. 


= 


Fy = Fy —@ x F uv + O(€?). 
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You may need the Jacobi identity 


SaBm fmCa + Seem fmAa + Foam fmBa = 0, 
which follows from 


Aa AB if Mit ad Tr(A 4g) = 26 
—, —]=ifagm— an r = : 
2 2 JAB 5 AAB AB 
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Spontaneous breaking of symmetries 


If my view is correct, the universe may have a kind of domain structure. 
In one part of the universe you may have one preferred direction of the 
axis; in another part the direction of the axis may be different. 

(Y. Nambu) 


In the gauge theory of the previous chapter, all gauge bosons and fermions are 
massless. In the real world the only massless vector particle is the photon. Evidently 
we must devise a procedure for giving masses to gauge bosons and other particles. 
During the past few decades, substantial progress has been made in understanding 
the connection between particle masses and symmetries. In theories with global 
symmetries, it is possible for the states to have the same symmetry as the operators 
of the theory, as is, for instance, the case with strong isospin. This, however, is not the 
only mode in which a symmetry manifests itself. In field theories the symmetry can 
be broken by giving a non-vanishing vacuum expectation value to some field, i.e. 


(82||&2) # 0. 


We say now that the symmetry is spontaneously broken.! In this case the operators 
of the theory exhibit the symmetry, but the physical states do not. In other words, 
for a symmetry which is spontaneously broken, remnants of the symmetry occur 
explicitly in the commutation relations of the operators, but are realized in the 
particle spectrum in a subtle way. In this chapter we study two such cases: the 
Goldstone mode and the Higgs phenomenon. 

In many cases a symmetry does not allow the introduction of a mass term. 
The breaking of the symmetry generates a mass term. This is demonstrated in 
Section 5.1, where the Lagrangian is invariant under a discrete symmetry. The 


1 The word spontaneous is used to communicate the idea that the phenomenon happens without any evident 
external cause, for example spontaneous combustion, spontaneous emission, .. . 
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selection of the vacuum state breaks the symmetry and at the same time generates 
a mass. 

For theories with continuous global symmetries the situation is different. The 
selection of a non-trivial vacuum generates masses, but at least one of the scalar 
particles must remain massless. This is the Goldstone phenomenon described in 
Section 5.2. Finally, in gauge theories, the particles that would become Goldstone 
mesons are eliminated by a gauge transformation and produce masses for gauge 
bosons (the Higgs mechanism). 


5.1 Spontaneous breaking of global symmetries: discrete symmetry 


Before we describe the general case, itis instructive to discuss a few simple examples 
in which the main ideas are transparent. Consider a real scalar field #(x) and the 
classical Lagrange function 


a dd dd M 
L= 2 axe ax, U(d(x)), (5.1) 


with U (¢) a potential depending on @. We are interested in finding the ground state. 
To this end we construct the Hamiltonian 


H= Lag + i) + U(¢). (5.2) 
The field with lowest energy is a constant field, whose value minimizes the poten- 
tial U(@). All this is classical. In the quantum theory (x) is an operator with a 
conjugate momentum. The field and its conjugate momentum satisfy commutation 
relations. The fields operate on the eigenstates of the Hamiltonian. For simple field 
theories it is possible to construct the eigenstates explicitly. The lowest energy is the 
ground state, also called the “vacuum.” The word vacuum is somewhat misleading, 
because the vacuum state is not empty but, rather, is a complicated superposition 
of many particles. The term vacuum is appropriate in free-field theory, where it 
corresponds to the state with no particles, but for interacting fields the vacuum is 
a complicated state with many particles present. The vacuum and other states for 
simple Hamiltonians are constructed explicitly in Problems 1-3 at the end of this 
chapter. In this book, by vacuum we mean the lowest energy state, which will be 
denoted by |Q2) or simply | ). 
In the class of theories of Eq. (5.1) we discuss two cases: 


A 4 u? 2 : 
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| U(4) 


Figure 5.1. The Higgs potential U(@). 


(I) Case 1. We select 
ot 2 
UG) =F + Se. (5.4) 


This is the familiar theory for a field @ with mass u? and an interaction term (A/4!)@%. 
The Feynman rules and other properties of this theory occur in many textbooks. The 
symmetry is explicit, with the Lagrangian being invariant under the transformation 


o> —9¢. 


All solutions are also invariant under this transformation. 
(II) Case 2. Now select 


J 2 
Ug) = 50 - Se. (5.5) 


In this case there is no mass term and U(@) must be considered as a potential. The 
shape and the minima of the potential at 


o = +,/ —— = v 


are shown in Fig. 5.1. We can select one of the minima as the ground state and study small 
oscillations around the minimum. This choice of the ground state breaks the symmetry, 
since the vacuum is no longer symmetric under the transformation ¢ — —@. We look 
for a solution in the neighborhood of v, and make the substitution 


g=v4+¢ with v= (Q|g|Q). (5.6) 
This describes small oscillations. In terms of the new field, 
Lav? ,, Awt 
O 
23 4! 
We note that the term linear in ¢’ does not appear, but instead the new field acquired 


the mass ,/Av2/3. Another result of the shift is the appearance of a cubic self-coupling, 
which spoils the symmetry of the original Lagrangian. In this case the original symmetry 


1 À 14 Àv 13 
UG) = gt tao + 


is not present in the solution that we have chosen. 
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Figure 5.2. The Higgs potential U(@1, ¢2). 


This simple example demonstrates explicitly the breaking of the symmetry. It is 
based on a discreet symmetry, i.e. the reflection of the potential. New phenomena 
occur when the Lagrangian possesses either a continuous global symmetry or a 
local symmetry. In the following we discuss both cases. 


5.2 Continuous global symmetries 


The SO(2) model Next we consider a theory based on a continuous symmetry 
and then we break it spontaneously. Let us consider a theory with two real scalar 
fields, ġı(x) and @2(x), and with the potential 


À 
U@) = Tdi +o - v)’; (5.7) 


hı and 2 are massless. This theory is invariant under rotation of ¢; and gy, i.e. 
invariant under the group SO(2). The rotations are described by the angle 0, 


¢;\ _[ cos@ sind] (¢; 
(lee ae (5.8) 


The potential is shown in Fig. 5.2 and has the shape of a Mexican hat. The minima 
of the potential lie on the circle 


PH =v". (5.9) 


We show in Fig. 5.3 the locus of minima at the bottom of the hat. The lowest energy 
state is any vector $ in the ¢;—@2 plane which ends at the circumference C. 

We consider next the quantum-mechanical case and select a minimum in a spe- 
cific direction. Without loss of generality, we select a coordinate system with the 
ġı-axis parallel to the vacuum state, then 


(g1) =v and (ġ2)=0. 
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2 


Qı 


Figure 5.3. The locus of the minima of the Higgs potential. 


Next we shift the fields, 
p= +v, h=h5, (5.10) 
and find 
, À 2 2 1\2 
Ub) = F( ? + bf +209). (5.11) 


On expanding this, we see that the ¢, field has mass, but the @2 field is mass- 
less. There again appear cubic terms in the fields, which break the original SO(2) 
symmetry. 

This is a transparent example of a more general phenomenon and it is worthwhile 
to elaborate on the general case. One starts with a Lagrangian invariant under global 
transformations of a group G. The minima of the potential have the same symmetry. 
Then we break the symmetry by selecting one of the minima to be the vacuum, 
which is invariant under the subgroup H of G. In this analysis there are generators 
{g;} of G that do not belong to H. They are broken by the selection of the vacuum. 
To each broken generator {g;} there corresponds a massless field. These fields are 
called the Goldstone bosons. The Goldstone bosons transform under G like the 
coset or factor space of K = (G/H). We demonstrate this general phenomenon 
with several examples. 


Example 1 In the SO(2) model, that we discussed, there is one generator 


—sin@ cos G12) 


1(0) = ei? = | cos 6 2] 


After breaking of the symmetry, the potential term in (5.11) is not invariant under 


the transformation 
di\ _ 
~ |=1(0 : 5.13 
($) l 16 one 
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The generator is broken and there exists one massless particle 2. When we change 
the orientation of the vacuum, there is still a massless particle, which is a linear 
superposition of ġı and ¢. 


Example 2 We consider the model with three real fields (¢), ¢2, @3) invariant 
under the group SO(3), that is, invariant under rotations in a three-dimensional 
space. We represent the state by a column matrix 


Qı 
p= | (5.14) 
$3 
and the Lagrangian by 
1 
L= K ARIA —U(y*¢), (5.15) 
with 
+ ere 2)2 
Up g= e p=). (5.16) 


We next break the symmetry by giving a vacuum expectation value to (63) Æ 0. 
After shifting of the fields we obtain the potential 


À 
UG) = O+ OF +o? + 20g)”. (5.17) 


This expression is still invariant under rotations around the 3-axis, but U(@) and 
the Lagrangian change when we rotate around the first and second axes. We can 
represent a general rotation through the Euler angles (œ, 8, y), which consists of 
the following three successive rotations: 


(i) a rotation through an angle «œ about the 3-axis with the transformation matrix 


cosa sina 0 
R(O,0,a@)= |—sina cosa Oj], (5.18) 
0 0 1 


(ii) a rotation through 6 about the 2-axis with R(O, £, 0), and 
(iii) a rotation through y about the 1-axis with R(y, 0, 0). 


The product of the three matrices gives the complete rotation 
R(«, B, y) = R(y, 0, 0)RO, B, 0)R(O, 0, æ). (5.19) 


The model is invariant under the rotations R(0,0, œ), but the R(y, 0,0) and 
R(O, 6, 0) generators are broken. To the last two generators there correspond two 
massless particles, as follows from the form of the potential in (5.17). 
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Example 3 The last example is a scalar theory invariant under global SU(2). For 
the field we consider a complex scalar doublet 


A F + i 

x)= = ele 5.20 

OM) Ke $3 + id ray 

Each of the fields has a real and an imaginary part. The Lagrangian is given by 
L= 3 pp —A(o'o — 0)’. (5.21) 


Again we break the symmetry by giving a vacuum expectation value to the real part 
of ġo, 


(Re po) = (3) = v. (5.22) 


When we shift the field ġo as before, the potential in terms of the components 6+ 
and ġ\ becomes 


UH) = Afilo? + 16? + 2ARe dv]. (5.23) 


The new potential is not invariant under transformations of SU(2) with the two 
fields in (x) as well as Im ġo remaining massless, as is verified by expanding 
Eq. (5.23). 

To sum up, we found that in field theories with continuous global symmetries the 
breaking of the symmetry requires the existence of scalar particles of zero mass. 
In fact, to every broken generator, there corresponds a massless particle. These 
are representative examples of Goldstone’s theorem, which follows from general 
properties of field theory. 


Goldstone’s theorem If there is a continuous global symmetry transformation 
under which the Lagrangian is invariant, then either the vacuum state is invariant 
under the transformation, or there must exist spinless particles of zero mass. 

We demonstrate the content of the theorem by studying the symmetry properties 
of a general potential. 


(i) We assume that the potential V (¢;) contains a set of real fields that transform according 
to a representation T° of the group G 


G(x) = G(x) + ie Ti; (x). (5.24) 


When the fields belong to the adjoint representation, the number of fields equals the 
number of generators. 
(ii) We assume that the potential is invariant under the group G. Then 


ƏV ƏV 
ôġi = i— e Tip; x) = 0. (5.25) 


eT ap Y 
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Since the £“ are arbitrary and continuous variables, it follows that 


aV 


a, i? (x) = (5.26) 
(iii) At the minimum of the potential 
ƏV 
ae =0 (5.27) 
‘lo=u; 
for each ¢;. Differentiating Eq. (5.26) again gives 
av aV 
Tibi + Tj. = 0. (5.28) 
abidd J" agi " 
(iv) At the minimum of the potential, the second term vanishes and 
T TS 0 (5.29) 
“vu; = 0. ; 
Abid}! 
The mass matrix is 
3? V 
2 
— 5.30 
Kab) 9bK| ON 


Equation (5.29) is an eigenvalue equation with Tv; being the eigen-vectors. There 
are two important possibilities now. The first is 


(a) Tv; =0, (5.31) 


which means that the generator T^ annihilates the vacuum. The states corresponding 
to the generators T“ have the symmetry of the group. In the second possibility there 
are generators T? for which 


(B) Tv; #0, (5.32) 


in which case the symmetry generated by the T? is not a symmetry of the vacuum. In 
this case T? vj is an eigenvector with eigenvalue zero, i.e. the states To j have zero 
mass. In Lanei theories this is a proof of the theorem, which at the same time 
demonstrates which particles remain massless. 


Physical examples of the phenomenon occur in non-relativistic many-body sys- 


tems. The Heisenberg ferromagnet is an example that consists of an infinite array of 


spin-4 magnetic dipoles. The Hamiltonian is rotationally invariant but the magnets 


in the ground state are aligned with all spins parallel, thus breaking the rotational 


symmetry. In this case the frequency of the spin waves goes to zero with the wave- 
number. 


In particle physics the phenomenon is relevant to understanding the connection 


of symmetries of the hadronic currents. In Section 2.2 we showed that, in the limit 
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of zero pion mass (m,, = 0), the axial current is conserved. The conserved vector 
and axial currents generate an SU(2), x SU(2)p algebra. Therefore, the symmetry 
is present in the currents and leads to many important predictions. It is absent 
from the particle spectrum, since there are no parity-degenerate multiplets. Think 
of the (pt, p°, p~) and (At, A°, AT) isospin multiplets. The ps transform into 
each other with SU(2)vyector and the As with SU(2)axia1, but there is no connection 
between the two multiplets through SU(2) x SU(2)p transformations. In other 
words, the operators are SU(2),, x SU(2)p-symmetric but the particle states are not 
(mp # ma). This physical situation can be understood in SU(2)L x SU(2)p theory 
with spontaneously broken symmetry. The SU(2),q generators are broken by non- 
zero vacuum expectation values and must be accompanied by zero-mass Goldstone 
bosons. This is in agreement with the fact that the pions have masses much smaller 
(nearly zero) than those of all other hadrons. Strictly speaking, the pions should be 
massless, but corrections to the potential or radiative corrections can produce small 
masses. 


5.3 Spontaneous breaking of local symmetries 


A new phenomenon occurs in local gauge theories, which is crucial for constructing 
theories with massive gauge bosons. The simplest example is scalar electrodynam- 
ics, which was introduced in Section 4.2. We consider the Lagrangian of Eq. (4.36) 
with the potential 


VEH) = -Kop + APY. (5.33) 


The theory is invariant under the gauge transformation defined in Eq. (4.39). In 
addition, the Lagrangian is invariant under a global rotation of the scalar field 


p > el, (5.34) 


with «œ independent of space and time. We first demonstrate properties of the theory 
under global transformations and then indicate the changes introduced in a gauge 
theory. 

For the vacuum state we select one of the minima of the potential. By a global 
rotation we can transform (@) to a real value. We represent the field and its vacuum 
state by 
2 1 


= en ) and (2) =0. (5.35) 


af 2 


As in the previous cases, we translate ¢, 


P(x) = G(x) + v, (5.36) 


$ ($i + id2), wy == (4 


2a 
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and leave #2 unchanged. In terms of the new field, the potential becomes 


4 
-E +a (01? + 83) + 4079)? + 4v6i (02 +03)]. 6D 


VG ae 


The ¢; field acquired a mass and there are also trilinear interaction terms. In addition 
there are changes in the kinetic terms, which we describe in Problem 5.4; important 
among them is the property that A,, acquires a mass. 

Alternatively, we can study this theory as a gauge theory. We define two real 
fields 0(x) and p(x) by the relation 


ei?) p(x) +v 


(x) = (5.38) 
á Z 
and give p(x) a vacuum expectation value. 
Then we observe that the local gauge transformation 
= eTO) = P(x) + A 
v2 (5.39) 


, 1 
A,,(x) = A,(x) + aa R] 


eliminates 60(x) completely. This transformation is unusual, because the field itself 
occurs in the gauge transformation; but it is legitimate in all respects. After the gauge 
transformation, the first two terms of Eq. (4.36) retain their form, with primed fields 
replacing the old ones. The net effect is 


1 7 À 
L= Fiw P + DPD, + OH- Flee) l", 
(5.40) 


with Di, = ðu + ieA',. The second term, 
1 1 
(DHID o) = z3upd p + 578A, A" (p? +20 +0), (540) 


generates a mass for the A, field. The Goldstone field disappeared and the vector 
field became massive. 

We have described a second important case of spontaneous symmetry breaking. 
We started with a locally invariant theory describing a charged scalar field (two 
degrees of freedom) and a massless gauge field with two polarizations. After spon- 
taneous symmetry breaking there is one real scalar field and a massive gauge field 
with three polarizations. The spontaneous breaking of the symmetry redistributed 
the degrees of freedom: one of the two real fields forming the complex scalar field 
was transformed into the longitudinal polarization of the vector field. This exam- 
ple illustrates that the spontaneous breaking of local symmetry does not produce 
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Goldstone mesons, but gives masses to the gauge bosons. It will be used later on 
in order to create masses for the intermediate gauge bosons. 

The spontaneous breaking of scalar electrodynamics is physically unrealistic, 
because electric charge is not conserved. This is evident from the presence of the 
Ap term in (5.33). It is a consequence of the fact that we introduced a non-zero 
vacuum expectation value for a charged field, 


(QULDIR) Æ 0, (5.42) 


which in itself violates charge conservation. In realistic theories we can preserve 
the conservation laws by giving non-zero expectation values to fields that carry the 
vacuum quantum numbers. 

This phenomenon was introduced in the sixties in order to evade the Goldstone 
theorem and maintain gauge invariance, despite the fact that the vector meson 
acquires a mass. At that time it was thought to be relevant for the strong interactions. 
Later it was extended to non-Abelian gauge theories. It is now used in order to 
break the gauge symmetry of the electroweak theory and produce masses for the 
intermediate gauge bosons. It is referred to as the Higgs mechanism and we describe 
it in Chapter 7. 


Problems for Chapter 5 
1. Consider a one-dimensional harmonic oscillator. Its Hamiltonian 
1 
H= 5 (P° + wx’) 


can easily be rewritten in terms of the classical variables 


1 1 
a = ,/—(@ox +ip) and at = .|—(wox — ip). 
20 20 


In quantum mechanics a and a* are operators satisfying the commutation relations 
[a, at | =] and [a,a]= [ar a] =0. 
Compute 
(i) the eigenstates of this Hamiltonian, 
(ii) the time development of the operators a and at, and 
(iii) the matrix elements of a and a* between arbitrary states. 


When you have done all this, then you have solved this quantum field theory completely. 
2. The Hamiltonian for an asymmetric oscillator is 


1 
H= 5 (P° + wx’) + kx. 
Replace again the position and momentum variables with the operators a and at. The 
Hamiltonian reads 


1 
H= zelata +aat)+k(a +a”). 
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The problem now is to find a unitary transformation such that 
UaU" =a-k/ao, 
UatUt =at —k/ap. 


With the help of U it is possible to eliminate the linear term kx and reduce this problem 
to the previous one. 
3. The vacuum state, |(2), is not always the empty state, |0}. This is demonstrated with the 
Hamiltonian 
Oe 2 i 29 
H= 7a atza ) Taa 
The number operator N = ata does not commute with the Hamiltonian. Consequently 
the eigenstates of N are not eigenstates of H. 
Find the lowest-energy state of H. To this end, construct two operators with the 
properties 


[H, o+] —+ 9. 


These operators raise and lower the energy by one unit. The lowest-energy state is defined 
as usually by the condition 


Q7|Q) =0. 


You can represent the vacuum as }_`„ c,(a*)"|0), and then use the above condition to 
give an explicit formula for |Q2). It is possible to write the normalized vacuum state in 
closed form. Finally, construct all higher-energy states. 

4. Work out the kinetic term for scalar electrodynamics using the new fields of Eq. (5.36). 
Show that the field A, acquired a mass and demonstrate the appearance of trilinear 
interaction terms. 
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Construction of the model 


The next task is to find a gauge theory that contains the weak and electromagnetic 
currents described in the previous chapters. We consider a gauge model of electrons 
and their neutrinos. At the very beginning we must answer two questions: 


(i) which group should we select for the theory; and 
(ii) to which representation of the group should we assign the fermion fields? 


The currents of the theory must include at least the charged weak current 


Fi) = Bex) y,(1 — ysel), (6.1) 
its Hermitian adjoint, and the electromagnetic current 
Ji (x) = @(x)y,e(x). (6.2) 


We need three vector fields with which to couple them. They correspond to the 
intermediate gauge bosons, W~, and the photon. The smallest group is SU(2). This 
group, however, is unacceptable because the currents (6.1) and (6.2) do not form 
an SU(2) algebra. This becomes evident on considering the charges and studying 
their commutation relations. Consider the charges 


Tt = 5 [es vi (x) — ys)e(x), Py, (6.3) 
The commutator is 
[T+, T7] = ; J dx Pyt — ye), eT (yd = ys)v(x)] 
= 5 f &xey[vteod = yey = etd = ree =», 
iT? =[T*, T7], (6.4) 


which is not the charge operator corresponding to the electromagnetic current. 
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There are now two alternatives: 


(i) introduce new leptons and modify the weak current Jọ so that we get the right SU(2) 
algebra, or 

(ii) introduce another gauge boson W3 and its corresponding current. In this alternative there 
are four gauge bosons, W~, Z, and y, and the group must be enlarged to SU(2) x U(1). 


Both alternatives were actively studied and it became evident only after the dis- 
covery of neutral currents that Nature prefers the second solution. 

We consider a theory based on the group SU(2) x U(1). We must decide how the 
electron and its neutrino transform under SU(2) and U(1), separately. From (6.3) 
and (6.4) we see that the charges 


1 
T* = 5 J dx[vta = ys)e] 3 (6.5) 
T sr], (6.6) 
, 1 
iT? = > J dx[vta — ys)Ve — e+ (1 — ys)e] (6.7) 
generate an SU(2) algebra. This means that the left-handed fields 
= (lyse and v =>- ys) 
Sg ysje an wS y5)V 
form an SU(2) doublet, 
v= (“’). (6.8) 
eL 
The charges are defined as 
T= J Bx vit (F)H (6.9) 
and 
Q=- J d’x ete = — J dx (eter + ezer). (6.10) 


Since we should include Q in the group transformations, Q must be a combination 
of T; and the generator of U(1), denoted by Y: 


1 
an Q-T,= -7 | Px(cter + 2eter + vtn). (6.11) 


This relation involves the difference between the charge Q and the weak isospin T3 
and it defines a new quantum number: the weak hypercharge. It is analogous to the 
Gell-Mann—Nishijima formula of the strong interactions which was established by 
empirical data. With this definition the charge coincides with what we were always 
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using as charge, but the weak isospin and hypercharge, when extended to quarks, do 
not always coincide with the corresponding hadronic quantum numbers. In (6.11) 
there appear the left-handed and right-handed leptonic number operators 


NR = [ox eter, (6.12) 

M = J d’x (eter + vivi). (6.13) 
Thus the hypercharge operator in SU(2) is the unit matrix which commutes with 
the other generators of the group, 

[Y, T; =0 for i= 1,2,3. (6.14) 
From the relation Y = —(NL + 2Np) we deduce that 


y= 1 for left-handed states, 
~ |2 for right-handed states. 


This satisfies the original requirements of selecting a group and the representations 
for the fields with 
VL 
yY = : ) an SU(2) doublet, 
L 
PR = eR an SU(2) singlet. 


Finally, we give the parts of the Lagrangian describing the fermion and gauge 
fields. We denote by wi the gauge fields of SU(2) and by B, the field of U(1). The 
field tensors are written as 


Fi, = 3 Wi — Wi + ge*WiWi, (6.15) 
G w = ð Bv = ð Bu, (6.16) 


and the Lagrangian for the gauge fields is 
1 i i, uv 1 Hv 
Lg = Po ma que i (6.17) 
with a summation understood over repeated indices. The gauge fields at this stage 
are massless. The Lagrangian for the leptons is 
l tt WE \ Y 
78 pw) PL: 


(6.18) 


bet {9 nE a A 
Lr = PRY Q F ie’, Pr + iby, @ + ig’ 5 Bu + 


We notice that the leptons are also massless because there are no PRY and PL Vp 
terms, which indeed are not SU(2) x U(1)-invariant. 
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The Higgs mechanism in the 
Glashow—Salam—Weinberg model 


7.1 Masses for gauge bosons 


In order to give masses to the gauge bosons and the fermions, we follow the method 
described in Section 5.3. We introduce a complex scalar doublet 


_ |% 
$= ie (7.1) 


From the relation Q = T; + Y/2, it follows that Y = 1 for @. Each of the fields 
has a real part and an imaginary part, so there are four independent scalar fields. 
The Lagrange function for the scalar sector is given by 


Ly = (Du) (DG) — Vb"), (7.2) 
with the covariant derivative defined by 
. ee 
Dy = 9, + ig’ By + ig Wa (7.3) 
and the potential by 


Veep) = -upo + Ab"). (7.4) 


We have been gradually enlarging the Lagrangian and so far it consists of three 
terms: 


L2G Peat Le (7.5) 


It contains fermions, vector bosons, and scalar fields and is invariant under gauge 
transformations of the group SU(2) x U(1). 
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Classically, the potential V(@) has a locus of minima at 


ƏV 
= = bs + 22A(lb4 1 + Idol) = 0, (7.6) 
Ipi 
OV 
a= — pw bo + 2A (141? + lol?) G0 = 0; (7.7) 
po 
that is, at 
2 
Ibs? + Idol? = (7.8) 


We can choose the ground state (vacuum state) at the minimum of the potential. 
Since we wish to conserve charge, the field must carry vacuum quantum numbers 


u 


V/a 


In the quantum theory the symmetry is broken by introducing 


Idol = and $, =0. (7.9) 


ae ae, otc 
w= (25) with ee 


In other words, one of the neutral scalar fields acquires a vacuum expectation value 
at the minimum of the potential. 

The next step is to try to rewrite the Lagrangian in terms of fields displaced 
relative to the minimum of the potential and arrive at a physical interpretation. The 
selection of a vacuum expectation value chooses a direction in the potential, thus 
breaking the symmetry. We define four scalar fields, £1, &, £3, and n, by 


—p- le 0 
ġ=U (wana) (7.10) 


where U -IE ) is the unitary transformation 
ae ig -7 
U-'(é) -(-52). (7.11) 


This is very similar to the discussion concerning Eqs. (5.38) and (5.39). Again, we 
define new fields through a gauge transformation 


oe OE 0 
g>o¢ =U) = P (7.12) 
Y> VL =U@W and y= Yr, (7.13) 
EL Cee any eae ee ee ee eee 
st Wy > 5i Wu = 5U@e- W,U ®t Lal '® ue. (7.14) 
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This transformation has the form described in Chapter 5, with a new feature: 
the fields themselves occur in the transformation. Upon substitution into the 
Lagrangian, the terms Lr and Lp retain the same form when expressed in terms of 
the new fields, but Ly is modified. In fact, as we show next, several of the scalar 
fields disappear and the Lagrangian has a new physical interpretation. 

Consider the Lọ term and set 


ai 0 _v+n : _ (0 
= (wa ma) ee ee x= (1) a 


On substituting for the @ field in terms of the new field, it appears as if we are 
making a gauge transformation. The covariant derivatives become 


D g = aun + 50 +n)(g'B, + erwi) | (7.16) 


x 
RB 
1 
(Did) (Did) + hc. = 5 (un(o"n) 
1 Ask ae 
+ 3+ ny xT f(g Bu + gt'W,,)(¢'BY + gti wily. 
(7.17) 


The cross-term is purely imaginary and does not appear in the product. We study 
in detail the structure of the second term, 


(g'B, + gt - W,)(g'BY +g- W") = [;”B,B" +8°W, W" +2gg'B,? - we, 


(7.18) 
and between the x states 
x+. x = 87B, BY + 9° WLW" — 2gg'B We! 
= (eB, — gW?) + 2g WtW 
= (g + 9")Z,Z" +28 WIW. (7.19) 


The evaluation of the term linear in T is most easily done using T - W, = 
V+W +W} )+ Ww} and properties of rt*x. New fields were also 
introduced: 


_ ~8 Wit 8'Bu 


Wi = ——(W} +iW2), Za 
and 


(7.20) 
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We note that the fields W~ and Z are now massive with 


My = ev and Mz = Le + gP) v, (7.21) 
but the field A, remains massless. The physical correspondence for the fields 
is evident. A,, represents the photon and the other three the intermediate gauge 
bosons of the weak interaction. An interesting property is the disappearance from 
the Lagrangian of the €,, 2, and &3 fields. These three degrees of freedom were 
transformed into longitudinal states of the massive vector mesons. This form of the 
theory, with its clear physical interpretation, is referred to as the unitary gauge. 

To sum up, we have constructed a theory with vector, scalar, and spin-4 particles 
based on the symmetry group SU(2) x U(1). The symmetry was broken in the Higgs 
mode by introducing a non-zero vacuum expectation value for the neutral field go. 
Then it was shown that a judicious choice of the gauge eliminates three scalar fields. 
In this gauge the physical interpretation is clear, with the states W+ and Z? being 
massive. They also have three longitudinal degrees of freedom. 

In the quantum theory, the breaking of the symmetry by Eq. (7.9) implies 
that the vacuum is not the empty state but a complicated superposition of states, 
as demonstrated for the simple Hamiltonian in Problem 3 of Chapter 5. Condi- 
tion (7.9) does not break the symmetry completely, because the charge generator 
annihilates the vacuum and the law for charge conservation is preserved. The other 
three generators are broken and to each of them there corresponds a massive gauge 
boson. Their masses satisfy the relation 


2 2 
za a (7.22) 
Mz; & +8 
They also couple to fermions through charged and neutral currents, which satisfy the 
SU(2) x U(1) algebra. These and other couplings will be studied in the following 
chapters. 

Finally, the simple mass relations (7.21) depend on the fact that the field @ 
was a weak isodoublet. It survives even if @ is replaced by a finite number of 
isodoublet fields. It fails, however, when Higgses belonging to other representations 
are introduced. Consider, for instance, a theory that contains, in addition to the 
doublet, a triplet of Higgs fields, 


S=| g], (7.23) 


with (£?) = o # 0. Then, by repeating the steps (7.16)—-(7.21) and using the SU(2) 
matrices for the three-dimensional representation, the reader can verify the new 
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mass relations 


1 1 
My = 580 +07)? and Mz= E + g’) o. (7.24) 


7.2 Masses for leptons 


The standard model based on the group SU(2) x U(1) allows us to make some of 
the simplest choices. It is the simplest group which contains charged, neutral, and 
electromagnetic currents. This is at the expense of introducing two coupling con- 
stants, g and g’, which are related through the Higgs mechanism to other parameters 
of the theory (masses of gauge bosons, structure of neutral currents, . . .). 

It has the simplest multiplet assignment for the fermion multiplets which is 
consistent with parity violation. The left-handed particles are SU(2) doublets and 
the right-handed components singlets, 


Ve = 2) » ep, (7.25) 
e 
L 
with the same pattern repeated for the other two families: 
Yu = (2) „i and Y= es sf eee (7.26) 
CAE Po 


We note that there are no right-handed neutrinos because it was thought that they 
are massless. This attitude changed with the discovery of neutrino oscillations, 
which require small and finite masses. The subject of neutrino masses is covered 
in Chapter 13. 

Masses for leptons are generated through Yukawa couplings. A Yukawa inter- 
action invariant under SU(2) x U(1) is given by 


Ly = geWeher + hic. (7.27) 
As mentioned earlier, the symmetry is broken by giving a vacuum expectation value 
to p°: 


ae ene is (7.28) 
0) ee N12) EA 


which gives the mass me = (1//2)g.v. Similarly, masses are generated for the mu 
and tau leptons. The lepton masses remain arbitrary parameters without any relation 
among them. 

The theory has the simplest symmetry-breaking mechanism. The Higgs particles 
are in the fundamental representation of SU(2) containing just enough fields to make 


W+ and Z? massive and leave one neutral Higgs as a physical particle. This pattern 
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of symmetry-breaking provides a consistent way to control the higher-order terms 
by absorbing infinities into the masses and couplings of the theory. °t Hooft (1971) 
derived the correct Feynman rules in a class of gauges and constructed gauges for 
which the theory is manifestly renormalizable. Detailed studies of renormalization 
and unitarity followed (Lee and Zinn-Justin, 1972; °t Hooft and Veltman, 1972). 
This remarkable success opened the road for many investigations and predictions 
that have been confirmed by experiments. 


Problems for Chapter 7 
1. Use the matrices 
1 0 1 0 1 0 -i 0 
à = —|1 014, ù% ʻ{=—ļ|i 0 -i and 
Zoro 2o i 0 
l 1 0 (0) 
às = — |0 0 (0) 
v2 0 0 -l1 


for the J = 1 representation of SU(2) and the Pauli matrices for the J = 1/2 represen- 
tation in order to prove Eq. (7.24). 

2. Generally, it is possible to construct SU(2) x U(1) theories with several multiplets of 
scalar fields. We denote them by ¢; and they carry weak isospin J; and have a neutral 
component J3;. If each neutral component develops a vacuum expectation value v; //2, 
show that the W and Z masses satisfy 


1 
2 2 27,,2 
My = 58 2 [E + D- Ei] v7, 
M? = sec? Oy g” 5 È, v?. 
Find the first values (J, I3) for which the relation 


Mw = Mz COs Ow 
is maintained. 
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The leptonic sector 


8.1 Feynman rules 


We gave in Chapter 6 the Lagrange function for the fermions and the gauge bosons. 
In the previous chapter we defined the physical bosons with definite masses. It 
is now a straightforward exercise to rewrite the Lagrange density in terms of 
the physical bosons and read off the Feynman rules. For the rules, it is neces- 
sary to introduce quantized fields in order to keep track of the combinatorics and 
other factors, especially for diagrams with closed loops. The canonical quantiza- 
tion method in terms of Wick’s theorem does not work for non-Abelian gauge 
theories because there are ambiguities that arise from gauge transformations. The 
appropriate discussion at this point is the quantization in the path-integral formal- 
ism. This will be a long digression and will delay us from arriving at physical 
results. We adopt a compromise. We consider the fermionic part of the Lagrange 
function in terms of the physical fields and read off the relevant vertices. The 
interested reader can compare this method with the procedure used in textbooks 
of quantum electrodynamics. In this way we obtain an extensive set of Feynman 
rules for vertices and propagators, in terms of which we discuss many physical 
processes. 

Later on, we repeat this procedure for other parts of the Lagrangian, which in- 
clude Higgses and gauge bosons. The rules that we obtain suffice when we calculate 
tree diagrams to any order. Difficulties occur when loop diagrams are computed, 
beginning with one-loop diagrams. The difficulties are solved by introducing addi- 
tional diagrams with scalar particles: the Faddeev—Popov ghosts. 

We saw in the previous chapter that the neutral gauge fields mix among them- 
selves. It is appropriate to introduce a mixing angle 


d 


uiy i, (8.1) 
g 
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The physical fields defined as mass eigenstates are given by 


W; = Zm +iW2), (8.2) 
A, = cos(@w) By + sin(6w)W?, (8.3) 
Z, = sin(Ow) By — cos(w)W;. (8.4) 


As stated earlier, A,, represents the photon, while Z, and Wọ represent the neutral 
and charged intermediate gauge bosons, whose masses satisiy the relation 


—W = cos?ĝw. (8.5) 


In the leptonic Lagrangian (6.18) we substitute for wi and B, in terms of the fields 
W7 > Zu, and A,. Using again the identity 


T- Wu = V2ctW, +t WI) + W3, (8.6) 


we can read off the couplings of the WF bosons to charged currents. 
The couplings of Z, and A, to their respective currents follow after some algebra. 
The neutral gauge couplings are 


_ oY : Y 
Lync = —Wry"'g z Bur — wry" (e 5 =B, + s> ow? ala 
Ses Y wl t+wypyt r + B A 
=Y 5 Yr + Wry z t3 VL 
E Y ” Y fe 
+ s|- (Gre + hy" Sh) + Phy" S| Zu, (8.7) 


with (c, s) = (cos Ow, sin 6w). The weak hypercharge Y can be replaced according 
to (6.11) by 


ee a - 
Ter YR = VROY" Yr, (8.8) 
_ Y L 
svi = P(o- F) (8.9) 


giving finally 


Lyc = —gs(Wry" Ove + try" OW)A 
+ [pirt Ry hey" Opr + viv" OW)]|Z,. (810) 
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Finally, on substituting for 


YL = C) and Wr = €r, 
L 


we obtain 


Lr = ié fe + ið pv — gsA,éy"e 


8 r PO > 
te ey ew, hey ah] 


n zpra — ys)v +ēy"yse — (1 — 4s?)ēy"e]Zu. (811) 


The third term above is the coupling of a massless vector particle to the electro- 
magnetic current of electrons. Its coupling is evidently the electromagnetic charge 


e = g sin 0w. (8.12) 
We can read off the following vertices: 
f 
~ieQ yy" 


4 cos Ow 7 75) 


70 


~] 


ig 


H(1 — Asin? — 75 
deas rie sin“Ow — Ys) 
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z 
aë 


l 
The propagators for particles are introduced in many textbooks. For the fermions, 
i 
p- m+ie 
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For vector mesons the propagator depends on the gauge. We will frequently use 
the Feynman gauge, for which the three relevant propagators are 


B igw 
SANAN PLi 
y , 
19 ww 
D ph = My Fie 
Wn 1 
Z p? — M2 +ie 


In an arbitrary gauge the vector boson propagator depends on the gauge parameter 
E as follows: 


Anp) = Ee tE 1) PuPv/(p? — EM?) 
pv = ; 


p? — M? + ie 
In special gauges we obtain 
; Suv 
=1 Aw = i Feynman gauge 

E H p? = M2 +ie ( y 8 g ) 
.&wv — PuPv/ p? 

E = 0 Aw = EM + ie (Landau gauge) 

vo v M? : 
E= 0o Aw = A” PuPol (unitary gauge) 


p? — M? +ie 
In addition to electromagnetism, the theory describes weak interactions mediated 


by charged W~ bosons and the neutral boson Z. Charged-current interactions are 
mediated by the W~ bosons, whose mass satisfies the relation 


1 
Mw = 58v. (8.13) 


In low-energy reactions the W masses can be factored out — or integrated out — 
giving an effective four-fermion interaction with the coupling 


Gr g 

— = : 8.14 

V2 8My on 
The electroweak theory goes beyond the V—A theory and predicts the existence of 
neutral currents mediated by the Z bosons, whose mass 


1 1 
Mz=-= 
f 28" cos Ow 


(8.15) 
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is related to Mw through (8.5). At low energies the neutral-current interaction also 
reduces to an effective interaction, whose overall strength is determined by 


(- ig i S i Gm (8.16) 
4cosOw) p?— MZ 16M2, 2/2 


The Lagrangian (8.11) defines the weak and electromagnetic interactions of elec- 
trons and neutrinos. It contains four unknown quantities: g, sin’, and the masses 
Mw and Mz which occur in the propagators of the bosons. In addition there is the 
mass of the electron, which is to be taken from experiment. All four quantities are 
not independent, since three of them are related through (8.5). We can use three ex- 
perimental quantities to determine them. Electromagnetic measurements determine 
the fine-structure constant 


e? _ 1 
Mn «137.036... 


The muon decay is used to determine Gr. Neutral-current measurements, discussed 
in this and subsequent chapters, determine 


a= (8.17) 


sin’?Oy = 0.222 + 0.010. (8.18) 


From the three low-energy measurements we now determine the other parameters: 


Ta M2 
M2, = —— ~~ 7 WwW (8.19) 
S J/2G¢ sin’ Ow “~~ cos2@w 
and 
1 
jo == = 246GeV. (8.20) 
(VG) 


8.2 Predictions in the leptonic sector 


We have now a theory that enables us to compute many processes at the tree level. 
In this chapter we compute three leptonic processes. 


Boson decays Among the many predictions of the model, the decays of the gauge 
bosons are simple to discuss. We begin with the decay 


W -e VY. (8.21) 


The diagram in Fig. 8.1 gives the amplitude 


M = NARA — y5)v(k4)e", (8.22) 
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Figure 8.1. The Feynman diagram for a W~ decay. 


with £“ being the polarization of the intermediate boson. For the decay rate we sum 
over the polarization states with 


X eže = —8guv + a (8.23) 
Pol W 
The contribution from q„q» is proportional to the masses of the leptons and can 
be neglected. We also ignore terms proportional to lepton masses in the trace com- 
putation. The mass of the neutrino being small or zero does not cause any dif- 
ficulties, because in the spinor normalization iu = 2m they simply do not occur 
in the formulas. Had we used another normalization, i.e. uu = 1, then we could 
give a small mass to the neutrinos and proceed to calculate decay rates and cross 
sections, but we will find in the end that the neutrino masses drop out of the 
formulas. 
The square of the matrix element summed over spins is 


2 


Do MM = © 2. Try yo + med = my)] (8H) 
Spins, Pol 
= 207k, - k. (8.24) 


The decay rate is given following standard rules: 


1 dko 1 dk 
2E. OTP 2Ey 27} 


r= = J ky k_Ony 8 (p — ky — k) 
2Mw 


2 3 
GEM. 

= Ê My= PW, 
16x 2/27 


We must still average over the initial polarizations of the gauge bosons and obtain 


(8.25) 


I _ GrMy 
(Qs+1) 62m’ 


which gives 211.3 MeV for My = 80 GeV (235.9 MeV for My = 83 GeV). 

The total width is obtained by adding the additional decays into UV, TV+, and 
quark pairs. We introduce the vertices of the gauge bosons to quarks in the next 
chapter, but we mention here that the decay of W~ into a quark pair of definite color 
is also given by (8.26). Thus, for three generations of quarks and leptons the total 


T= 


(8.26) 
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width is obtained by multiplying the width by 12: (3 lepton families) + [(3 quark 
families) x (3 colors)] = 12; 


2Gr Mẹ; 

Jin - 
In leptonic decays all that can be observed is the charged lepton. In hadronic decays 
the W bosons are inferred by reconstructing the hadronic jets, which imitate to some 
extent the kinematic characteristics of the original quarks. 

In contrast, the leptonic decays of Z bosons into charged leptons are identified 

by the invariant mass of the pairs. By comparing the coupling constants we obtain 
the partial decay widths 


Total = (8.27) 


Grp M3 
TZ > vy) = —— Z (8.28) 
12/27 
and 
GpM3 
T(Z—> ete7) = —4 (1 — 4 sin’6y + 8 sin*y). (8.29) 


12/27 


For the total width, we need in addition the decay width into quarks (see Problem 1). 
Summing again over three generations, 


GpM3 (21 29 40., 
Liota (Z) = — — 10sin?ðw + — sin*6y J. 8.30 
total(Z) ae sin wh oD Ww (8.30) 


It is worth noting that the total width is sensitive to the total number of quarks and 
leptons lighter than Mz and precise measurements of Tiota could produce exotic 
surprises. The width of the Z boson has been determined in the CERN experiments 
to be 


Tz = (2.490 + 0.007) GeV. 
The width in turn limits the number of neutrinos to 
y = 3.09 + 0.13, 


which is very close to the number of neutrinos allowed by nucleosynthesis 
arguments, Ny ~ 3—4. 


8.3 Leptonic neutral currents 


A striking piece of evidence for the electroweak theory was the discovery of neutral 
currents. The Lagrangian in (8.11) describes both charged- and neutral-current 
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interactions of neutrinos and electrons. All the couplings depend on the SU(2) 
coupling constant g and the Weinberg angle 60w. At low energies, the overall strength 
of the neutral-current interaction is determined by Gp through Eq. (8.16). Thus 
all neutral-current interactions must depend on a single parameter, sin?ĝw. There 
is a large number of neutral-current reactions that have been measured and the 
agreement after two and a half decades of research is indeed impressive. This 
section describes in detail leptonic neutral-current reactions. The reader will find 
this section very useful also for the semileptonic interactions discussed in Chapters 
10-12, since many of the formulas can be taken over. We consider first neutrino— 
electron scattering. Six reactions of this type are shown in Fig. 8.2. 

Reactions (1) and (2) can proceed only through neutral currents. Other reactions 
like 


Ve(k) + € (p) > Velk’) +e (p') (8.31) 


involve both charged- and neutral-current diagrams. For low-energy reactions it is 
convenient to write the Feynman amplitude in the form 


M= ge [Pya — eys)vēy" (gv — gays)e]. (8.32) 
V2 

In this form both vertices retain the charge of the lepton. Evidently, not all reactions 
are of this form, because several of them include also charged-current reactions. 
Charged- and neutral-current reactions have different propagators and in addition 
the order of the spinors is different. Charged-current reactions can be transformed 
into the charge-retaining form by Fierz’s reordering theorem. A special form of 
the theorem states that, when at least one of the couplings is (1 + ys), then we can 
interchange the first and the third (or second and fourth) spinors. All examples in 
Fig. 8.2 involve a neutrino or antineutrino whose vertex is y, (1 — eys) with e = 1 
for neutrinos and £ = —1 for antineutrinos. As an illustration, consider the reaction 
(8.31) to which the two diagrams in Fig. 8.3 contribute. The amplitude is 


a @ alkyl — ys)u(k) - ül(p)[y"ys — A —4s°)y" uC) 


2? i 
3 q? — Mz + ie 


. 2 ` 
1g —1 py =ø! 
H5) E E YYu(1 — ysJu(k) -alk yA — ys)u(p) 
°<My . GFR; T/A 1 
ae p4 Vu — ys)u(k) - ū(p IE +25?) y" = sv" s|ucn) 


G 
= “isa yl — ys)u(k) - ū(p') (gvy" — gay" ys) u(p). (8.33) 
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Neutral current Charged current Reaction 


(1) we —> ye 


Ve — Vy e™ 


(3) Vee —> vee” 


(4) Ye —> he 


Pek Yok Pook pod 


(5) Wwe > Veu 


(6) Vee > Vu We 


l © 
+ 
W 
© 
Dee 
Figure 8.2. Diagrams for neutrino-electron scattering. 
In this way we can write all reactions in the form (8.32). The explicit values for 
€, gy, and ga for five reactions are given in Table 8.1. We can now calculate the cross 


section for the amplitude (8.32) and obtain the specific reaction by substituting the 
values from Table 8.1. 
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Table 8.1. Effective couplings for several reactions 


Electroweak theory V-A theory 

Reaction E 8v 8A Ev 8A 
Vute —> vute +1 —5+2s? -5 
Vite > vute —1 -4 +25? —5 
Ve te > Ve +e +1 +4 +4 25? +3 1 1 
Vete > Ve +e —1 +4 +25? +5 1 1 
Vute >uL +Ve +1 1 1 1 1 

Ves k > Veo k' Ve k Ves k’ 

Z + W 
e7, p e7, p’ e7, p e7, p’ 


Figure 8.3. Z and W exchange in neutrino—electron scattering. 


We begin with the amplitude in (8.32) and compute the differential cross section. 
For simplicity, for the moment we set £ = 1 and the mass of the electron to zero 
(whenever allowed). At the end we give the complete formula with ¢ and a small 
term proportional to the electron mass. The square of the amplitude summed over 
final spins and averaged over initial spins is 


—» 1 7 
IM | = 5 pa MM 
pins 
G2 
= 4 ly — yky (1 — ys) '] 


x Tr[y (ev — gays p+ m)y"(gv — gay p +m)]. (8.34) 


Averaging over initial spins brings in a factor of 1/2 because the neutrinos are 
always left-handed. Evidently the expression factorizes into two tensors, 


= G2 
M| = Lut, (8.35) 
with 
Luv = Tr[ (1 Ey yskyok | 
= kik, + kyk', — k - k'8uv + i£pvapk” k”) (8.36) 
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and 


Luv = Tr[(gy + gay" By’ p + 2eveaysy" By’ p], (8.37) 


where we neglected terms proportional to m2. The computation of the £, tensor 
is similar to that of £,,,. In contracting the two tensors, we observe that products 
with different symmetries in u and v vanish: 


IMI? = 16G5[(gv + ga) (k- p(k’ > p') + (gv = 8a) (k p(k’ p)]. (8.38) 
We choose to compute the cross section in the laboratory frame, where the initial 


electron is at rest: 


1 
2mMe 


1 kK 1 &p 


do = . 
2E!, (27)3 2E! 2r) 


i = 
zg Mier sk +p-k-p') (8.39) 


We perform the d? p’ phase-space integration with the help of the 5+-function. The 
last integration over the scattering angle involves the integral 


E! 
J ak’ ô[m? — (k —k' + py] = n(=)aey, (8.40) 
v 


The ô-function gives the relation between the scattering angle and the energy 

transfer: 

(E — E')me 
EE’ ` 


l1 — cos 0 = (8.41) 

Since the average value for (E’) ~ E,,/2, we can estimate the average scattering 

angle 0 ~ 2°/ JVE with E measured in GeV. The scattered electron comes out at 

very small forward angles and provides a unique signature for the experiments. 
From (8.38) and (8.40) we obtain the final result 


d G2me EIN? 
ae ftev + en? +v- so (2) | (8.42) 


dE’ 27 Ey 


Had we used the amplitude (8.32) and retained the mass of the electron, the final 
result would have been 


do G2m E! 2 MeV 
Five 2 2 v e 2 2 
— + + = SN a e — . 8.43 


The last term, which is proportional to the electron mass, is small, so it can be 
neglected at accelerator energies. The variable v = Ey — E‘, denotes the energy 
transfer. 
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Formula (8.43) is useful in describing all reactions shown in Fig. 8.2 that include 
charged-current reactions. For instance the reaction 


Vu te >u +. (8.44) 
reduces after substitution to 
d 2G? 
GS ge, spade (8.45) 
dE’ T 


The same result holds for the semileptonic reaction 
Vutd>u +u, (8.46) 


which we rewrite in terms of the inelasticity y = (E — E’)/E and the square of the 
center-of-mass energy, s = 2M Ey, as 


doY G? 
Ss sat Ee (8.47) 
dy T 
For both of the reactions 
Vu +U > u” +d, (8.48) 
Vi PU pp + d (8.49) 
the differential cross section is 
doř G? 
E ff aa? (8.50) 
dy T 


We see that, when both vertices are left-handed or both are right-handed, then do /dy 
is independent of y as in (8.47). On the other hand, when one vertex is left-handed 
and the other right-handed, then do /dy is proportional to (1 — y)? as in (8.50). 

Neutral-current reactions have a mixed y dependence. As an illustrative example, 
consider the neutral-current reaction 


Vute >Vite, (8.51) 


for which the cross section is 


2 

Os Cae Bi — 2sin?0w)? + 4sin Ow — y)’]. (8.52) 

dy 27 
This reaction has been studied in several experiments. When the existence of neutral 
currents was still in doubt, a few events of this type were observed in the Gargamelle 
experiment. In spite of many attempts, one could not attribute them to any other 
origin. This evidence was gradually reinforced by information from semileptonic 
neutral-current reactions until their existence was accepted. Today there are exper- 
imental results from a few hundred events for reaction (8.51). The average slope 
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from all the experiments is 


= = (1.6 + 0.4) x 10-42 cm? Gev~!, 


y 


yielding a Weinberg angle given by 
sin”ðw = 0.222 + 0.010. 


Data for the other neutrino reactions are also available. In the gy versus ga plane 
each of the above total cross sections limits the physical region to an elliptical band. 


8.4 Weak effects in electron—positron annihilation 


An interesting and very important reaction occurs in electron—positron collisions, 


et 


e7 = itu, 

which can be mediated by the exchange of a photon, as well as by the heavy boson Z°. 
At low energies relative to the mass of the Z boson the photon diagram dominates. 
On the other hand, at a center-of-mass energy close to the Z boson’s mass, the 
gauge boson dominates and exhibits a resonance behavior. At intermediate energies 
there is an interference term between electromagnetic and weak interactions, which 
modifies the angular distribution. 

The amplitude for the process has two diagrams producing two amplitudes: 


M = My, + Mz; 


2 
e 
My = —— Uk) yuu(k-) O(p+)y"u(p-), 
4 cos? 0w q? — MZ + iMzI 
x Dlk+)Yul8v + gaysJulk-)DP+Y "(8y + 8ays)ulp-). (8.53) 


Mz = 


We consider energies high enough that one can ignore the masses of the electron 
and muon. We also included the width of the Z particle in the propagator, which 
will lead to a cross section with a Breit-Wigner formula. In field theory the width 
is generated by summing the decays to all possible final states. For the couplings of 
the Z boson to electrons we introduced general coupling constants gy and g4; their 
dependence on the Weinberg angle follows from the Feynman rules. Similarly, g4 
and g/, are couplings to muons, which are equal to the couplings of electrons. 
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At intermediate energies, such as ,/s = 30-50 GeV, the well-known electromag- 
netic formula is modified by the presence of the interference term 

do 

dQ 

with s the square of the center-of-mass energy and 


2 
= mae + cos’) (1 + e(s)gy) + 2e(s)gn cos o], (8.54) 


(8.55) 


The new feature is the cos 0 term, with the consequence that the differential cross 
section is not symmetric in the forward—backward direction. The new term arises 
from the neutral current but is not parity—violating. It has a new angular dependence 
typical of the ys coupling. We define a forward—backward asymmetry 


S do (0) — do (x — 0) = 2cos@ , 
MO) = oea Leone 


(8.56) 


The asymmetry was measured in many experiments and gave values for g4 = 
consistent with the standard model. 

At higher energies the interference term becomes larger. As the center-of-mass 
energy approaches the mass 


—1 
2 


Mz = 91.188 + 0.002 GeV/c’, 


the weak term dominates and produces the cross section 


2 
do (5) 7 | 2 2)\2 2 229 
= gy + gx) (1 + cos) + 8858 cos 6], 
dQ 8c?) (s — M3)? + M2r? (sv + 84) YSA 
(8.57) 


where c = cos Ow. The resonance was observed at CERN and was studied carefully 
to give precise values for the mass and the width of the gauge boson quoted in this 
chapter. 

In addition to the muons, electron—positron collisions also produce qq pairs, 
which are analyzed with the same formulas. The values for gi, and g are now 
replaced by couplings appropriate for quarks. 


Problem for Chapter 8 


1. Compute for each generation the decay width of the Z boson for decays to neutrinos, 
charged leptons, and quark pairs separately. Then estimate the total decay width. 
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Incorporating hadrons 


9.1 The mixing matrix 


The weak interaction for the leptons was introduced into the theory by arranging 
the left-handed leptons (chirality — 1) in three generations of doublets and the right- 
handed charged leptons into three singlets: 


Ve y Vv _ _ _ 
(o ’ P ’ & ; Cr, HR, TR: (9.1) 
L L L 


The similarities of the interactions of leptons to those of quarks suggest that one 
should similarly introduce for the quarks left-handed doublets and right-handed 
singlets. The situation for the quarks is different, since all of them are massive. For 
this reason each quark field has a right-handed component. The fields are classified 
as three doublets, 


v u’ x c ; t 
al = ( Ne d-(5). and ai = ( i (9.2) 
d L S JL b L 


and six right-handed singlets, up, dp, Ck; SR» tk» and bp. The superscripts denote 
three generations and the primes indicate that they are gauge quarks. The part of 
the Lagrangian which contains the kinetic terms and the couplings of the quarks to 
w=, Z°, and photons is written as follows: 


/ 1 
L= auy" fio, + iw, + Ere lai + auv” fion + Ere, ak (9.3) 
The operator Y denotes the weak hypercharge, which has been defined already, 
Y =2(Q — h). (9.4) 


At this stage it is not clear whether the fields u’, d’, . . . stand for the physical states 
because Eq. (9.3) contains only kinetic and interaction terms. Physical fields are 
eigenstates of the mass matrix which will be introduced below. This is the reason 
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why the quark fields in Eq. (9.3) have a prime and we referred to them as gauge 
eigenstates or gauge quarks. 

Masses for the quarks are generated through quark—Higgs Yukawa couplings. A 
Yukawa interaction invariant under SU(2) ® U(1) gauge transformations is easily 
constructed: 


Lmass = hi) (El, d) o +h” (ü, d) a +h (9.5) 
mass — "'(d) ui ’ jJL o? JR (u) ui , i 47 UjR C., m 
where i, j = 1, 2, 3 and the Higgs fields are the same fields as those introduced in 
Chapters 5 and 7. The matrices hiy and hi denote couplings of i and j quarks of 
the up and down types, respectively. The symmetry is broken by giving a vacuum 


expectation value to ¢°: 


= A = 2 0 9.6 
6= (40) mt) 8) 
and 
ae Ken —ġ° = 1 v+ n 
pmm = ( (O ) breaking =( 0 | (D 


with 7 the field fluctuation around the minimum. Spontaneous breaking of the 
symmetry generates the mass terms 
v 


ius AG i+ dg; + dih? dh) + hc. (9.8) 


The expressions 
M® =p and MO = A® (9.9) 


occurring above are called the mass matrices. From the way they were intro- 
duced, there is no reason for them to be either symmetric or Hermitian. In fact 
the Lagrangian in (9.5) is manifestly gauge-invariant and the mass matrices are 
to a certain extent arbitrary. The mass matrices are very important because they 
determine the masses and the flavor mixing of the quarks. 

The quark fields that have been investigated up to now are, as has already been 
mentioned, non-physical gauge eigenstates. To find the physical or mass eigenstates, 
we must transform the quark-mass matrices into diagonal form. 

Any square matrix can be diagonalized by a bi-unitary transformation. Therefore 
it is always possible to find four matrices UL R and D, g that diagonalize the mass 
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matrices, 
m 0 0 
M® =UFMVUR=| 0 m 0), (9.10) 
0 0 m 
me 0O 0 
M® = DŻ MODr=| 0 m 0). (9.11) 
0 0 Mp 


The mass eigenstates, to be denoted as unprimed fields, are related to the gauge 
eigenstates by the transformations 
uLi = (UD ijuk; URi = (UR )ijUkj» 
di = (DE)ijdi j, dri = (DẸ)ijdRj- One) 


In terms of the mass eigenstates the mass term is now diagonal. Therefore we can 
substitute the physical fields everywhere in the Lagrangian and deduce the physical 
couplings. The neutral couplings expressed in terms of physical quarks retain the 
same form as they had with gauge quarks. The charge current after the substitution 
becomes 


ji = Üü; Yudi = Ūti Yp Vijdij (9.13) 
and the charged-current interaction is 


é 
J2 


where V = Ut D and summation over repeated indices is understood. This matrix 


Lee = =(47W, + ja Wi), (9.14) 


is one of the most important quantities in the standard model, because it contains 
information on all possible flavor-transitions and CP violation. It is called the flavor- 
mixing matrix or the Cabibbo (1963)—Kobayashi—Maskawa (1973) (CKM) matrix. 

By construction the flavor matrix is unitary, a property that will be used exten- 
sively in the next section. The mixing matrix is derived directly from the mass 
matrices, which shows that all information about it is included in the mass matri- 
ces. A determination of the mass matrices from experimental data is impossible 
because they contain 36 real parameters (9 complex numbers for each charge sector). 
By contrast, there are only ten quantities that can be determined by experiment: 
six quark masses and four independent mixing parameters. The fact that only four 
parameters of the mixing matrix are relevant can be understood as follows: a unitary 
N x N matrix may be expressed by N? real parameters. Among them N(N — 1)/2 
can be chosen to be the rotation angles of an orthogonal matrix and the remaining 
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N(N + 1)/2 taken as phase angles. Not all phases, however, are physical. Each 
quark field has an arbitrary phase that can be used to eliminate a phase of the CKM 
matrix, except for an overall phase. This means that one can arrange the phases of 
the quark fields in such a way that they eliminate phases in V;; of Eq. (9.14). An 
exception to this rule is an overall phase that is lost when we square matrix elements 
in order to produce probabilities. Thus N x N flavor mixing can be parametrized 
by N(N — 1)/2 rotation angles and 


N(N + 1)/2-QN —-D=(N — I(N — 2)/2 


phase angles. For three quark generations (N = 3) there remain three rotation 
angles and one phase, which is responsible for CP violation. So in all there are four 
parameters that describe the mixing matrix. 

The unitarity of the mixing matrix that is required in gauge theories is a conse- 
quence of the unitarity of the matrices U, and Dy: 


Vii Vig)’ = Vik Vig = diy: (9.15) 


This relation expresses the orthogonality of rows and columns within the matrix. 
As was shown above, the mixing matrix can be parametrized with four quan- 
tities, but this does not determine the functional form of the matrix. The first 
explicit parametrization was given by Kobayashi and Maskawa. They used Euler- 
type angles for three-dimensional rotations in the flavor space and one phase: 


C1 —S1C3 —S1S3 
Vem = | sic c102c3 — 5283e = c10053 + soc3e J, (9.16) 
S152 C1803 + c983e — C5983 — Coc3e" 


where the abbreviations s; = sin 0; and c; = cos 0; are used. The parameters are 
chosen so that, for 0. = 63 = ô = 0, the three-dimensional mixing matrix is reduced 
to the corresponding one for just two doublets. The angles 6; may without loss of 
generality be chosen to lie in the first quadrant, 0 < 6; < 2/2. The phase angle ô 
may take any value within the interval [—z, zr]. 

This parametrization is just one possibility. Another one that is very customary 
was given by Maiani. It is quite suitable for investigations of B-meson decays: 


CpCo CpS6 Sp 
=) io ið 2 is’ id! 
Vu = SpSy Coe SgCy CyCo — SpSys6e"  Sycpe™ J. (9.17) 


—sS gC Co + Sy sge : 
PSY So ye 


—SpSeCy — Syce Cply 

The quantity s, is mainly the coupling for b —> c and sg is mainly for b — u. 
Ranges for angles and the phase can be chosen as in the Kobayashi-Maskawa 
parametrization. 
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A further parametrization is the one by Wolfenstein. In this case the elements 
are expanded in terms of a small parameter A = sin 6, exploiting the experimental 
information about the smallness of the mixing angles. The structure of the matrix 
is determined by the unitary conditions of the mixing matrix: 


1-3)? à AX? (p — in + in5d) 
Vw = = — 5A? — in A? Ad?(1 + ina?) . (9.18) 
AM(1 — p — in) =A 1 


In contrast to the parametrizations discussed so far, the one by Wolfenstein is only 
an approximation, being an expansion in a small parameter. The unitarity condition 
is satisfied to a given order in à. The real parts of the elements are correct to the 
order A? and the imaginary parts to order A>. The parameters A, 7, and p are of 
order unity or even smaller: 


A = 0.221 + 0.002, A=10+0.1, yp?+ n? =046+0.23. (9.19) 


Magnitudes for elements of the mixing matrix are determined directly from ex- 
periments. They in turn are translated into values for the rotation angles. We will 
review the experiments and the corresponding values. It is much more difficult to 
obtain values for ô, since it is related to CP-violating quantities. We shall return to 
its determination in Chapters 15 and 16. 


9.2 Flavor-changing neutral couplings (FCNCs) 


The structure of the mixing matrix has another important consequence. The neutral 
couplings of the theory preserve flavor to a large degree of accuracy. The suppression 
of flavor-changing neutral couplings is required by many experimental results. For 
instance, the decay K —> uw is highly suppressed. The branching ratio is 


Br(Ky > wh) = (7.2 + 0.2) x 107°. 


The D°D° mixing has not been observed at the 107° level. These properties and 
others are incorporated into the theory by the construction described in Section 9.1. 
In fact, FCNCs are absent at the tree level. The proof of this follows from the 
structure of neutral currents. We can begin again with the Lagrangian in Eq. (8.10) 
and substitute the ys with quark fields. The couplings of the quarks to the Z and y 
are 


3 3 
-i & =i =i 
Lwe =—e DG! Ov" giAn+ =) {a tsv"gi— 8G Ov"ai}Zu. 9.20) 


i=l l=1 
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Table 9.1. Couplings of quarks and leptons to Z° 


States &v SA 
Up quarks 5 = $ sin’ Ow 5 
Down quarks -4 + 3 sin? Ow —5 
Neutrinos 5 5 
Charged leptons E 5 +2sin’Aw -4 


Figure 9.1. Vertices appearing in box diagrams. 


At first sight the quarks occurring in (9.20) should have a prime, since they are still 
gauge quarks, but Eq. (9.20) is diagonal in the quark fields and the unitary matrices 
UL r and VL r will disappear when the quark fields are replaced by physical states. 
Thus the omission of the prime is justified. 

Next we introduce a convenient notation and write the neutral-current couplings 
in the form 


& csi i 
Lie= RF y” (ev — gays)q'Zu, (9.21) 


with qg' representing general states of up and down quarks or leptons. The couplings 
are given in Table 9.1, where we also include the neutral couplings to neutrinos and 
leptons. In this way neutral couplings are diagonal at the tree level. 

The suppression that has been introduced so far is not sufficient. Flavor-changing 
effects will now appear through higher-order corrections that involve charged cur- 
rents. Higher-order effects are O(Ga) and this suppression is not sufficient. How- 
ever, the method introduced so far suppresses FCNC to the level O(Gam;, / M3), 
where mq is the mass of the quark in the intermediate state. We can see this by 
considering the upper line of the box diagram shown in Fig. 9.1. The Vgm matrix 
elements which occur in this line include a mass-independent term, 


VŠ Vij = Vig Vii = Ski. (9.22) 
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and a mass-dependent term 


xy Maj _ y y+ Mai 
Vo ag, = VeVi Me (9.23) 
The leading term vanishes by virtue of the unitarity of the mixing matrix and 
the next term is proportional to the mass of the intermediate quark. This is the 
famous Glashow—Iliopoulos—Maiani cancellation scheme (Glashow et al., 1970). 
Processes involving quarks in intermediate states, which are light relative to Mw, 
give a very small contribution. The mechanism has important consequences for 
box and penguin diagrams. These diagrams occur for K°-K° mixing and the ex 
parameter. 

The above requirements for flavor conservation in neutral couplings determines 
to a large extent the representation assignment of the fermion fields. There is a 
general theorem, which states that, for a gauge theory based on the group SU(2) x 
U(1), the bounds of FCNC are satisfied if we classify (Paschos, 1977; Glashow and 
Weinberg, 1977) the quarks into representations of the group in such a way that 
quarks of the same charge and the same helicity have the same T (total weak isospin) 
and 73 (third component of isospin). For quarks of only two charges (2/3, —1/3) 
it implies that there must be equal numbers of up and down quarks. 

We illustrate the implications of the result with some examples. 


Example 1 Models with three quarks are not allowed, since they will produce 
strangeness-changing neutral currents. To solve this problem Glashow, Iliopoulos, 
and Maiani (Glashow et al., 1970) introduced a charmed quark. The matrix M is 


M= ease sin 0 (9.24) 
—sind cosdé 
and the charged current 
u 
= 0 M ë 
+ =: as x 

Ja = (12ā5)(6 o) d (9.25) 

s 


Example 2 Models with five quarks, u,c,d,s, and b, produce flavor-changing 
couplings. To eliminate these couplings the top quark was introduced. The charged- 
current interactions are now described by Eqs. (9.13) and (9.14). 


9.3 The elements of the mixing matrix 


There are many processes that determine values for the elements of the mixing 
matrix. They involve products of the weak couplings times hadronic matrix 
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elements. Estimates of the latter require methods of strong interactions as they 
apply to low or high energies. For the sake of brevity, the description of hadronic 
methods given here is short. The aim is to give a general impression of the 
methods and arrive quickly at the relevant numerical results. The interested 
student will find more details in the references or in the following chapters, 
especially Chapters 11-14. 


9.3.1 Determination of Vaa 


For the weak interaction we introduced in Chapter 2 the Fermi coupling constant 
Gr, which is related to the SU(2) coupling by Eq. (8.14). It was also mentioned there 
that its numerical value is determined by the muon lifetime. To obtain a precise value 
for Gr it is necessary to include radiative corrections from the exchange of photons 
and gauge bosons, as well as the emission of photons. Such diagrams in general 
introduce infinities, which must be treated with special care. The electroweak theory 
is renormalizable and the infinities can be absorbed into a few coupling constants. 
In this book we do not cover the method of renormalization, but refer to an article 
and a book (Sirlin, 1978; Bardin and Passarino, 1999). The precise value for the 
Fermi coupling constant is 


G, = (1.166 32 + 0.00004) - 1075 GeV’, (9.26) 


with the subscript indicating that it is obtained from the muon lifetime. 

The method which was used to construct the hadronic Lagrangian requires that 
the charged currents for quarks have the same coupling constant multiplied by the 
quark mixing matrix, as is seen in Eq. (9.14). This property is called universality. 
Thus the V,q coupling is given by the ratio of the coupling constant measured in 
B-decay, to be denoted by Gy, to the muon decay constant: 

Vad = oY. (9.27) 
Gy 
The most accurate experiments for B-decay, so far, were done in nuclei and involve 
OF — 0* transitions, also known as superallowed transitions. Their measurements 
and analyses have a long history. Precise determination of G y must include radiative 
and in addition nuclear corrections. It is beyond the scope of this chapter to describe 
the corrections in detail. The result of the analyses is a very precise value, 


Vaa = 0.9740 + 0.0003 + 0.0015, (9.28) 


where the first error is statistical and the second represents the theoretical uncer- 
tainty. 
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There are two other elementary transitions that are also relevant. The first is pion 
B-decay, 


nt —> m + et +v. 
The branching ratio for this decay has been measured to be 
Br(zt > z’ +e* + v) = (1.025 + 0.034) x 107$, (9.29) 


which has a 3% error and is not as accurate as ratios from nuclear B-decays. The 
determination of Vaq from this decay has no nuclear corrections but carries a larger 
statistical error, 


Vad = 0.965 + 0.016. (9.30) 


The second elementary transition is the decay of neutrons: n —> p + e7 + Vv. This 
decay depends both on the vector current and on the axial current, in contrast 
to the previous two cases, to which only the vector current contributes. Precise 
measurements of the neutron lifetime, 


Tn = 888.5 + 0.8 s, (9.31) 
give the value 
Vaa = 0.9801 0.0030. (9.32) 


We note that the three determinations are consistent with each other. The most 
accurate one from the superallowed nuclear transitions will be used later on. 


9.3.2 Determination of Vas 


There are several ways to determine Vas, among which the K;, decays are the 
cleanest. Hyperon decays give values that are almost as accurate. For the K-meson 
decays we use the reactions 


K? —> n etv, and K% —> Wetve. 


The transition is from a pseudoscalar to another pseudoscalar particle and only the 
vector current contributes. Its matrix element can be written as 


(ACP Jul KP) = C[ (Pu + PL) f+) + (Pu — PL) F-@)], (9-33) 


where q? = (p — p’)’, C is an isospin Clebsch-Gordan coefficient, and f+(q7) 
are the form factors. In the SU(3)-symmetry limit at q? = 0 the form factor 
is known: f,(0) = 1. Corrections to this value were computed to account for 
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symmetry-breaking effects. Then the width 


Gi, MÈ 
T = * C?| OIP Vasl 9.34 
19953 | f+)" Vas (9.34) 


determines Vas. The result including all corrections is 
Vas = 0.220 + 0.002. (9.35) 


The analysis of hyperon decays studies the decays of many hyperons, for which 
a x? fit is performed. They involve form factors of both vector and axial currents, 
which are assumed to satisfy the SU(3) symmetry. The general fit is an impressive 
success of the SU(3) symmetry since there is a single value Vas consistent with all 
the hyperon data and the above value. It carries a slightly larger error arising from 
the theoretical uncertainties. 


9.3.3 Determination of |V-q| and |V;s| 


One way to obtain the couplings | Veal and | Ves] is to study the production of charmed 
particles in deep inelastic neutrino—nucleon scattering. The particles produced 
decay semileptonically and appear as events with opposite-sign dimuons. The 
elementary interactions are 


v+tdou +c 
N K? +y +s, (9.36) 
y+d— ut +ē 
N E HyS. (9.37) 
The semileptonic decays involve a mixture of charmed particles whose branching 
ratio is taken as Be = 7.1 + 1.3%. 
Thus it remains to compute the production rates for charm quarks, which are 


discussed in Chapter 11. Here we mention that the original reactions are computed 
in the parton model as follows 


2 


E 2 2 
o(vN > cX) = [raU + D)|Veal* + 2rsS|Vesl iF (9.38) 


p 7 GME 
o(VN > CX) = 


[rŪ + By|Veal? + 2rs5|VesI2]. (9.39) 


The r-coefficients measure the suppressions in the production of charmed quarks 
due to phase-space restrictions. Estimates for the experiments at energies Ey = 
220 GeV and Ey = 150 GeV gave the values 


ra (220 GeV) = 0.91,  r;(220 GeV) = 0.72, 


rq(150 GeV) = 0.70, rs (150 GeV) = 0.66. (9.40) 
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The capital letters U, D, ... denote integrals of the quark distribution functions in 
the proton. They are extracted from data on high-energy neutrino—nucleon scatter- 
ing. 


In Eqs. (9.38) and (9.39) we have two equations with two unknowns and thus 
we can solve for Vig and Ves. The results are 


[Vea] = 0.22 + 0.03, (9.41) 
|Ves| = 0.75. (9.42) 


Another source of information on |V,s| is the semileptonic D-meson decays. 
They are proportional to gr CD] V.s| and require estimates of the form factor 
peg?) 

The values for | V.s| obtained by these methods have large errors due to theoretical 
uncertainties. These values have been superseded by measurements of W decays to 
identified charmed hadrons and the subsequent decays. W bosons decay to the pairs 
(uq) and (cq) with q = d, 5, b antiquarks. The sum of the squares of the couplings 
for the six decays should add up to the value of 2. Since five of the six couplings 
are well measured or they are very small, LEP measurements can be converted into 
a precise value of 


| Ves| = 0.996 + 0.016. 


Without the use of unitarity the central value from all measurements is consistent, 
with 0.97 + 0.10. With these values the upper-left-hand corner of the CKM matrix 
is known to a high degree of accuracy. 


9.3.4 B-Meson decays and the determination of Væ and Vi» 


The relatively long-lived B mesons made possible the determination of two more 
elements in the mixing matrix, Ve» and Va». The decays of the B mesons proceed in 
the spectator model with the decay of b quarks into c and u quarks. The total width 
is the incoherent sum of the contributions from the above two decays, corrected, of 
course, for the exchange of gluons as described by QCD. The method accounts for 
the semileptonic and non-leptonic decays. Both decays were used in determining 
Vc», while the semileptonic spectrum is used for constraining the element Va. In 
these estimates theoretical uncertainties enter the calculation and we shall discuss 
them in some detail. 
The total width is given by 


Liot = Tol | Val? + s| Vol’), (9.43) 


9.3 The elements of the mixing matrix 89 

where r and s are products of phase space, color factors, and QCD corrections, and 
25 
= Gm, 
19273 


There are theoretical uncertainties for r, s, and Ig. For instance, the factor Ip is 
sensitive to the b-quark mass: 


To (9.44) 


1 i =14 =). 
F 93 x 107!4s for mp = 5.00 GeV, (9.45) 


To. }1.22 x 107l4s for my =4.75GeV. 


The spectator and parton models were used for analyzing, along these lines, the 
lepton spectra of semileptonic decays. A consistent analysis determines two more 
matrix elements, V., and Vip, with ~20% error. 

An alternative method considers exclusive B decays in the heavy-quark effective 
theory (HQET). This is a systematic expansion in inverse powers of the heavy-quark 
mass. When the mass of the heavy quark is taken to infinity, the decays B —> D*¢v 
and B — Dév become equal. Eperimentally the two branching ratios are different, 
so corrections of O(1/m,) must be included. Consequently, specific final states are 
selected to determine 


|Vep| = 0.041 + 0.002 (9.46) 
and 
|Vup| = 0.004 + 0.001. (9.47) 


Some details for the calculations are included in Section 14.5. Finally, the discovery 
of the top quark was achieved by observing semileptonic decays that provide an 
approximate estimate of | Vip]. 

With the B-meson decays we close the discussion concerning the elements of 
the CKM matrix, of which six elements are directly determined by experiments. 
Values for the remaining three elements, involving couplings of the top quark, are 
deduced from the unitarity of the matrix. 


9.3.5 Summary and unitarity 


In this chapter we were able to derive accurate values for the matrix elements from 
tree-level constraints. We emphasize that we can determine only their magnitudes, 
not their relative phases. In summary, 


|Vual = 0.9740 + 0.0020, = |Vus| = 0.220 + 0.002, 
|Vea| = 0.22 + 0.03, |Ves| = 0.97 + 0.10, (9.48) 
|Vep| = 0.041 + 0.002, [Vu] = 0.004 + 0.001. 
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The unitarity of the mixing matrix restricts the matrix elements even further. The 
tree constraints together with unitarity give the following ranges reported by the 
Particle Data Group (Gilman et al., 2002): 


0.9741-0.9756 0.219-0.226 0.0025—0.0048 
[Vij] =| 0.219-0.226 0.9732-0.9748 0.0038-0.004 |. (9.49) 
0.004—0.014 0.037-0.044 0.9990-0.9993 


The small values for many of the elements justify the small-angle approximation 
and the Wolfenstein parametrization described in Section 9.1. Similarly, one obtains 
values for the other parametrizations as was done in Eq. (9.19). The phase ô is 
still undetermined. Its determination requires measurements of the CP parameters, 
which we postpone until Chapters 15 and 16. 

We are now in a position to test the unitarity of the mixing matrix. There are two 
types of constraints. 


(i) The sum of the squares of absolute values of the elements for each row or each column 
must sum up to unity. This can be tested for the first row, 


|Vaal? + |Vas|? + |Vabl? = 0.9970 + 0.0036, (9.50) 


which is consistent with unity. The radiative corrections for the Vaq element are very 
crucial because without them the right-hand side in (9.50) would be greater than unity, 
in fact >; |Vail? = 1.020 + 0.004. 

(ii) A convenient and pictorial way to summarize the content of the CKM matrix is in terms 
of unitarity triangles. Consider the entries of each row or column of the matrix as the 
components of a vector. Then the unitarity condition applied to any two columns is the 
dot product of one column with the complex conjugate of another column. For the first 
and third columns the condition yields 


Vaa VŠ, + Vea VŠ + Va Vč = 0. (9.51) 


The unitarity triangle is a geometrical representation of this equation in the complex 
plane. Each term in the equation is proportional to Ad? and, to leading order, 


VaV © AM, Va Vi, © AV (p + in), and VaV © AX( — p — in). 
(9.52) 


We can choose to orient the triangle so that Vea Vý lies on the x-axis and scale out the 
common factor AA? which is of order 1%. Now the coordinates for the vertices are 
shown in Fig. 9.2. The angles a, £, and y of the triangle are also referred to as do, 1, 
and $3, respectively. It is evident from the construction of the triangle that 6 and y are 
the phases of the elements Via and Vu», respectively: 


Via = |Viale?, Væ = |Væle (9.53) 
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Figure 9.2. The unitarity triangle. 


Once AA? is factored out, the triangle depends on p and n. Let us select the x-axis to be 
p and the y-axis 7. The shape of the triangle is now determined by three measurements. 
The first constraint comes from the magnitude of Vi» which determines a ring centered 
at the origin. The CP parameter ¢x determines a second region. Finally, the mass 
difference of the Ba mesons defines another ring; this time its center is at op = 1 (see 
Eqs. (15.53), (16.26) and (16.27)). The three regions are shown in Fig. 9.2, where their 
intersection defines the apex of the triangle. All additional measurements that depend 
on parameters of the triangle must reproduce the unitarity triangle (see Section 16.5). 


We have mentioned already that CP violation is attributed to the phase in the CKM 
matrix. Quantitative predictions for CP asymmetries always contain 


S152 53 sin ô or SpSySq Sin ô’ (9.54) 


as a multiplicative factor. For three generations of quarks there is a rephasing- 
invariant measure of CP violation. In terms of the elements, it is given by 


Jia = Im{ Vip Vey (Viy Vis)" }, (9.55) 


where i, j, k and a, 6, y are cyclic permutations of 1, 2, 3, i.e. once we give 
numerical values to i and a, the other indices are determined (Jarlskog, 1985). 
There are nine such invariants, which are all equal to each other. Their explicit 
form in the Maiani parametrization is 


Jig © Byso sind’. (9.56) 
For the central values of the angles 


Jig © 2.5 x 107459 sin ô’. (9.57) 
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The smallness of this quantity implies that CP parameters will in general be small. 
There are also exceptions to this rule, which happen when the CP-violating quantity, 
which is proportional to Jia, is divided by another small quantity. It is evident from 
this discussion that the CP asymmetries manifest themselves in two ways: 


(i) processes in which the rates are large have small asymmetries; and 
(ii) large asymmetries occur for observables when the branching ratios are small. 


Both situations appear in K- and B-meson decays, for which CP asymmetries have 
been observed. 

Beyond the estimates of CKM elements discussed in this chapter, there are 
additional limits from observations related to loop diagrams. The theoretical anal- 
yses are now more complicated and involve additional theoretical assumptions. 
The advantage, however, is that they investigate the quantum nature of the theory 
and lead to a consistent picture. In fact, there are additional checks for the angles 
(CP phases) of the unitarity triangle. We shall cover several of these exciting topics 
in later chapters of the book. 
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Part III 


Experimental consequences and comparisons 


10 


Deep inelastic scattering 


10.1 Kinematics for deep inelastic scattering 


The processes that we are studying in the next few sections are shown schematically 
in Fig. 10.1. An initial neutrino with energy E hits a proton, producing a final state 
of a muon with energy E’ and an undetected final hadronic state. 

The lepton vertex is well known. All the interesting structure is included in the 
hadronic vertex. The kinematics are shown in the diagram and they involve 


k,,, the four-vector of the neutrino, 

Ki the four-vector of the muon, 

q = k — k’, the four-momentum transferred from leptons to hadrons, 

P, the four-momentum of the target nucleon, 

v = pq/M, energy transfer in the laboratory frame, 

0, the laboratory angle of the muon produced relative to the incident neutrino, and 
Q? = -0 = -m + 2Ek’' (1 — cos 0) © 4E E' sin? 6/2, with k’ = /E? — m?,. 


The above definitions hold also for electroproduction, when the initial neutrino is 
replaced by an electron and the exchange particle is the photon. The discussion of 
this and the following section is restricted to neutrino reactions. The cross section 
for such a process in the rest frame of the proton is given by 


— 1 2 4 04 an aes ak 
do = QE)(2M) f m 2r) ô (k +p k Pu BO) (10.1) 


where 


n 
Pa =), Pit 
i=1 


with the summation over all final particle configurations, each of which contains n 
particles with momenta p; andi = 1, ..., n. The integration over the phase space 
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: £ Ni 
Figure 10.1. Inelastic neutrino—nucleon scattering, together with the coordinate 


system used in decomposing the leptonic current. 


of final-state particles and the summation over the configurations is given by 


J-E Sill 


Later on we shall specify the final state to be a single quark, with the product 
reduced to a single phase-space factor. 
The matrix element is 


G 
M= Futon — y5)uk)(Pnl J" |p). (10.2) 


We write the leptonic current as 

Ji = ak’ )y.C. — ys)u(k). (10.3) 
Neglecting the muon mass, the current is evaluated by multiplying it by a simple 
factor (Bjorken and Paschos, 1970): 


a(k, 1— k', 
a ee eo 


s,s! 


The factor of unity is introduced in order to change the numerator into a trace; 
the summation over spins does not change the lepton current because (1 — ys) is a 
chirality-projection operator, so the extra states introduced by }°. ,, contribute zero: 


‘lept __ 2 Tr[y (1 — Y5)K Yo K'] 
“ {2 Tril — ys Wyo Ky 


8(k E’ + KLE — guo k- K + i€yoapk"k'*) 


10.5 
4V E E' cos 0/2 l ? 
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Similarly, we calculate the square of the denominator, which produces a trace. 

Using current conservation, we can eliminate one of the components in j,, and 
expand the current in terms of three orthonormal polarization vectors whose spatial 
components lie along the axes shown in Fig. 10.1; the z-axis lies along q. This 
decomposition simplifies considerably in the high-energy limit v > 2M ~ 2 GeV; 
Q? < v’, which is all we consider in this chapter. An alternative way would be 
to square the leptonic current and compute the leptonic tensor. The method is 
straightforward and the interested student can use it in order to reproduce some of 
the formulas in Sections 10.2 and 10.3. Here we find the method convenient for 
introducing helicity cross sections. 

The three polarization vectors below correspond to the angular-momentum state 
|J = 1, m) with helicities m = 0, 1, and —1, respectively: 


s__ 6 T 142.6 O44 

Eu = [oye ’ ’ qo) er [o]? + 2y2’ ’ ’ ’ 

R 1 : 
eṣ = —(0, 1, i, 0), (10.6) 


O 2 


L 1 : 
En = va 1, a 0). 


They satisfy the conditions ež = +1, |eL:R|? = —1, and £s,.r : q = 0. In the high- 
energy approximation the current, evaluated in the laboratory frame, becomes 


1 1 
fe a es EO E BOY 
jw a=] 8 + (=) + (So) et]. aD 


The only change in a ton going over to antineutrino-induced processes is the 
interchange R < L. 
The integration over the phase space of the muon can be carried out, 


dK’ E'dE' d2 


2E'OQTP UTP 


In addition, we can transform to an invariant phase-space element, 


do _ x do 
dQ?dv  EE' dQdE"”’ 


arriving at 


dc G@E@ 1 g ae 
dQ? dv O72 E v (saam Sf jsp (2a) ô (Pn— pP -9); 
(10.8) 
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1 1 

E’ F E 3 
lept R L 
Ee (=) eae (=) Ew 


It is evident now that the amplitude (n|juJ H| p) is the sum of three helicity am- 
plitudes: scalar (As), right-handed (Ag), and left-handed (AL). The cross section 
is the sum of three helicity cross sections and three interference terms. When we 


Here 


average over the azimuthal angles of the hadrons produced, the three interference 
terms average to zero, as indicated by the following argument. 

Let I be a fixed set of final-state hadron momenta that are measured. Let T” = RT 
be the set of momenta obtained by rigid rotation of I about g (the z-axis) by the 
angle @. We kept the neutrino and muon momenta fixed and rotated the hadronic 
system. This is equivalent to keeping the hadrons fixed and rotating the neutrino— 
muon plane in the opposite direction. Under this rotation the only change in the 
cross section is to replace a as follows: 


i ae a = e Rei? + a she, (10.9) 


The rotation is equivalent to a rotation of the two polarization vectors £g 1, around the 


z-axis. 85, which is parallel to 7, does not change. Accordingly, only the interference 
terms 29(AsAze!*), 29(AsAfe'*), and 29(ArAfe”*) change. They produce 
terms linear in cos(@) and sin(@). By averaging over the azimuthal orientations 
of the final hardons, i.e. integrating over ¢ from 0 to 27x, the interference terms 
are made to vanish. Should one wish to isolate the interference terms, then it is 
necessary to construct appropriate moments over the angle ¢. 

To sum up, T denotes a set of hadronic momenta in the final state, whose angles 
relative to each other are kept fixed; the rigid rotation around q has been averaged, 
i.e. integrated out. In this manner only helicity cross sections survive. 

We define the helicity cross sections for absorption of the “virtual” W nucleon 
into final hadronic states by 

a p' 
2E p (27) 

(10.10) 


1 
ov, 0°) = naa) J Kanla? - JOP POT 8p — p — q) 


Here we have assumed that there is only one particle in the final hadronic state. 
When there are many particles produced, the phase space is replaced by a product 
of phase-space factors. These cross sections depend only on v and Q?. The final 
formula reads 


do G? E' Q? ick E' z E (10.11) 
= O — 07 — O0 > i 
dQ?dv 4m? E v ee Be 
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The absorption cross sections are not uniquely defined; for q? = 0 the flux factor 
for the exchanged particle is 2v. This is a convention and sometimes the factor 
has been replaced by 2v[1 — Q7/(2Mv)]. To avoid the zeros which appear for 
elastic scattering, we chose the overall factor in the helicity cross sections F =4Mv 
(Eq. (10.10)) 

On introducing the structure function 


1 2 
MOOS — Gaeta (10.12) 

2m v 

and the ratios 
(L) = = <l, 
205 + OR + OL 

OR 

CR a R E (10.13) 


205 tor +o, 

it follows that 
de. GPE’ 
dQ?dv 2x E 


2 V V 
Wv, Q fi +50- = (8)]. (10.14) 


The antineutrino—nucleon cross section is obtained from (10.14) by the interchange 
L < R. We shall find these cross sections useful in several applications later on. 

An alternative notation introduces the structure functions W,(v, Q*), W2(v, Q7), 
and W3(v, Q?) defined in the next section. They are related to the absorption cross 
sections through (10.12) and the following: 


2 
Wiw, Q*) = WaW, or(1 + zo +(R)], (10.15) 
2 >, 2M v? 2 
W3(v, Q°) = Wa(v, Q (1 + z) (L) — (R)]. (10.16) 
In the limit v?/ Q? >> 1 they reduce to 


Wiw, Q?) = vWa(v, Pa CL) + (R)], (10.17) 
2 2 2Mv 
»Ws(v, 0) = vWalv, 05r IE) = R) (10.18) 


10.2 Hadronic structure functions 


In the previous section we introduced structure functions that describe the hadronic 
vertex. Here we describe their connection with products of currents and their com- 
mutators. The formalism of this section is convenient in discussing sum rules or the 
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light-cone behavior of the product of weak currents. We define the hadronic tensor 
as 


Way = 27>) IPI OIP POP”, —p-q). (10.19) 
Si n 


Here Sf, sums over final states, and Es averages over the spins of the target 
nucleon. By exponentiating the delta function and using translation invariance, 


Jp) = e*I (Oe, (10.20) 


one obtains 


La 4. iqx + 
Way = ae [trem rife), (10.21) 


where the unitary relation L, |Pn)(Pn| = 1 was used. We may change W,,, into a 
commutator, 


LOSS f ad cage + 
Wav = fd Kee (PI[J} Œœ), AONE); (10.22) 
since the second term of the commutator, 
1 . 
— J dfx e'% (P| J,(O)J+(x)|P), (10.23) 
27 E 
vanishes. This is proven by reversing the steps and showing that (10.22) reduces to 


SP PIOPPI OIP p, — P +q) =0, (10.24) 


n 


since in the physical process go = En — Mproton = 0, but in Eq. (10.24) the ô- 
function argument implies 


qo = Po — p? = M — E, <0. 


By virtue of Lorentz and gauge invariance, W,,, can be written in terms of 
six scalar functions, which are better known as structure functions. In neutrino 
scattering, however, only three contribute to the inelastic cross section because 
the lepton current is conserved (for m,, = 0). The tensor relevant to deep inelastic 
scattering is 


P,P, Euvag P*GP 
u Woe q 


Wuv = —8u Wi + M2 2M2 


W3, (10.25) 


where the structure functions W,(Q?, v) and W2(Q?, v) arise from the product 
of vector ® vector currents and axial ® axial currents, whereas W3(Q7”, v) is the 
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interference of an axial current ® a vector current. The additional three terms are 


GWudv Pudv + Poqa .Paqv = Pogy 
MOr g pe 
Their contributions to the matrix elements and the cross section are proportional to 
lepton masses and will be neglected. 


W4 + 


10.3 Scaling and the total cross section 


The structure functions are functions of v and Q?, but at high energies both variables 
are very large. It was suggested by Bjorken (1969) that, in the limit v > 00, Q? > 
oo, with the ratio 


2 
x= Q 
2Mv 


the structure functions become functions of x only, i.e. 


= finite, (10.26) 


vWz3(v, 07) > F(x), (10.27) 
MW, Q3 > F(x). (10.28) 


This was established in experiments on deep inelastic electron—proton scattering, 
for which the limit is reached at relatively low values of Q?,2Mv = (1 GeV)’. 
Inelastic electron—proton scattering is closely related to neutrino reactions and we 
mention it later on in this section. 

In the scaling limit the relations (10.18) and (10.19) reduce to 


2x Fy(x) = PWI) + (R), 
XF3(x) = PIC) — (R). 


With this notation we can rewrite the cross section in a convenient form. For vari- 
ables we use x = Q*/(2Mv) and the inelasticity y = v/E, then we substitute the 
scaling functions into the cross section, Eq. (10.14), and change the phase-space 
variables to arrive at 


do _ G°ME 
dxdy — 


1 
[rno + (1 = y)Fo(x) + »(ı - 5) zo 


where the structure functions depend on the process under consideration. In order 
to obtain the corresponding cross section for an antineutrino-induced reaction, one 
should change the sign of the F; term and replace the structure functions with the 
charge conjugate. 

The main difference between electroproduction and neutrino-induced reactions 
is the nature of the particle exchanged. In electroproduction the particle exchanged 
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is the photon, which has only a vector coupling 
Pt = ak’ )y,ulk). (10.29) 


The vector—axial interference term is now absent and the cross sections og and oL 
are equal. Following steps similar to those of the previous section, one finds 


oD) a ied ie cos'(5) + 2W*% sin? G) (10.30) 


dQ?dv E Qt 
where Wf and Ws are electroproduction structure functions analogous to those 
introduced in Eq. (10.25). In the cross section we kept the scattering angle 0. 
However, we can substitute it in terms of Q? and the energies E and E’ and arrive at 
a formula analogous to (10.14). The superscript e indicates their electromagnetic 
origin. Numerous experiments have shown that the limits 


vWS(v, Q°) > F(x), (10.31) 
MW¢(v, Q?) > F(x) (10.32) 


are reached for relatively low values of v and Q?. This is shown in Fig. 10.2, where 
the structure function F(x) is plotted for a range of Q?. Deviations from the scaling 
law have also been established, and we return to this topic in Chapter 11. We show 
next that scaling predicts Ooto ~ E,: namely a linear rise with neutrino energy. 
From (10.14) and scale invariance (10.27) we find the averaged cross section 
over protons and neutrons 
2 er p2Mv a2 
PRES Zimo (i+ 0- W) 
G?M E' v v Sai 
=FZFÀė E0- EW) | a [at Reh], (1033 
where (R) and (L) imply that the appropriate averages of x have been taken. Then 
the total cross section is 


G?ME 


(L) (R) 


Kosi 1 
Í dx 5l + podl ar at (10.34) 


Otot = 


The factor in the curly brackets lies between 1 and i In particular, 
if op =s = 0, 


if op = 0L, 05 = 0, 


10.35 
if OR = OL = 0, ( ) 


if of =os =0. 


MN 
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| 
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II 
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Figure 10.2. Scaling of the structure function vpWy = F(x). 


From (10.33) we see that a linear rise in oj. depends on the property that v W3 is 
scale-invariant and the absence of a W propagator. The neutrino measurements give 


Ey 


a% = (0.677 £ 0.014) x 1078 cm? 
GeV 


We can also compare neutrino and antineutrino cross sections on isoscalar targets: 


The ratio is bounded between i and 3. The experimental data give 


Ey 


GeV’ 


o% = (0.334 + 0.008) x 107*8 cm? 


with the ratio of the two slopes being 0.501 + 0.015, which is consistent with the 
above prediction and close to the lower bound. 
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10.4 The parton model 


“Friends rush in where angels fear to tread.” 
(R. P. Feynman, at Fermilab, 1973) 


A physical interpretation of the scaling phenomenon is given by the parton model, 
which considers the scattering as the incoherent sum of scattering from point- 
like constituents within the proton, called partons. The point-like nature of the 
constituents reproduces scaling. By studying several reactions it was possible to 
deduce properties of the constituents, such as electric charge, and identify the 
partons with quarks. The parton model has been applied to a wide range of high- 
energy reactions, many of which will be covered in this chapter. Deep inelastic 
reactions together with hadron spectroscopy supply the major evidence for the 
quark substructure of matter. 


Neutrino—nucleon scattering 


The basic idea in the parton model is to regard the deep inelastic scattering as quasi- 
free scattering from point-like constituents within the proton. This happens when the 
scattering is viewed from a frame in which the proton has infinite momentum. The 
neutrino—proton center-of-mass system is, at high energies, a good approximation 
of such a frame. In the infinite-momentum frame, the proton is Lorentz-contracted 
into a thin pancake, and the lepton scatters instantaneously. Furthermore, the proper 
motion of the constituents within the proton is slowed down by time dilatation. We 
estimate the interaction time and lifetime of the virtual states within the proton. In 
the notation of the previous section and Fig. 10.3, the initial electron and proton 
are collinear and in opposite directions: 


k=-P, 
(10.36) 
ko ~ Po = P. 
In this frame 
p:q = Mv = (q0 + q:)P, (10.37) 
k: q = -Q°/2 = (qo = 4:)P, (10.38) 
from which it follows that 
mara (10.39) 
qo = 4P . 
The time of interaction is t © 1/qo, which for moderate values of x decreases as 
4P 
(10.40) 


T= ———__.. 
2Mv(i — x) 
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Figure 10.3. Kinematics for neutrino—nucleon scattering in the parton model. 


We visualize the proton as composed of virtual states called partons. We denote by 
x the fraction of the proton’s momentum carried by a constituent. The lifetime of 
the virtual states (Feynman, 1969; Bjorken and Paschos, 1969) is 


1 1 
T= = 
Be Bie Eg (Pet 3+ 0- xPP? 3 — VPF M 
2P 
N : (10.41) 


(ut + Pr) /x + (u3 + Pz) /U — x) — M? 


If we now require that t < T, then we must consider the partons, contained in 
the proton, as free during the interaction. In this limit the current interacts with 
just one of the constituents, leaving the rest undisturbed, thus making the im- 
pulse approximation valid. The above conditions appear to be satisfied in high- 
energy and large-momentum-transfer electron—nucleon scattering and in high- 
energy neutrino—nucleon scattering. The model could fail for x — 0 or 1, for 
which the expansion in (10.39) is no longer justified. The reader may have noticed 
that we use x with two meanings: the first one is Bjorken’s variable x defined in 
Eq. (10.26) and the second is the fraction of the proton’s momentum. This was done 
on purpose because the two variables are the same. 

The cross section of a proton is the incoherent sum of cross sections of the indi- 
vidual constituents. We denote by do;(x)/(d Q? dv) the cross section of a neutrino 
on a parton of type i, which carries a fraction x of the proton’s momentum, 


pë ~ xP". (10.42) 
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We denote by f;(x) the probability of finding the ith constituent carrying a fraction 
x of the proton’s momentum. Then the cross section is 


do _ 1 do;(x) 
T » Í do? que (10.43) 


The summation here is over all types of constituents within the proton and the 
integral is over the momentum fraction x. Thus the complicated hadronic structure is 
reduced to the incoherent scattering from point-like constituents times the structure 
functions f;(x). 

The point cross sections have already been derived in Section 8.3. For neutrino— 
parton scattering 


d G2 i 2 
7 A | OLS (10.44) 
dQ? d(piq/mi) T mj 2m; 
or 
do G? Q? 
= —ô| piq — — }. 10.45 
dQ?d(piq) 7 (» ema ) coe 
For neutrino—antiparton scattering 
d G? aN : 
sgl (emer (10.46) 
dQ*d(piq) x pik" 2 


with k” the four-momentum of the neutrino. 

The proton is built from two up quarks and one down quark, which constitute 
the valence quarks and give the proton its quantum numbers. In addition, there is 
in the proton a cloud of quark—antiquark pairs produced by the radiation of gluons 
and their subsequent conversion into pairs. The number of the pairs is infinite, but 
their momentum distributions have not been calculated explicitly. We denote the 
probability of finding an up quark carrying a fraction x of the proton’s momentum by 
u(x). Similarly, we denote by d(x) the probability of finding a down quark carrying 
a fraction x of the proton’s momentum. The cloud of quark—antiquark pairs of any 
flavor necessitates the introduction of additional quark distribution functions. For 
instance, a(x) and d(x) correspond to up and down antiquarks. Similarly, there are 
distributions s(x), 5(x), c(x), C(x), ... for strange, charm, and other flavors. 

When we substitute př = x P” into (10.44) we obtain the point cross section 


do; 


G2 Q? 
——— = —Mxd| xMv — — }. 10.4 
do?dv = x (x v 5 ) (10.47) 
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Finally, on substituting the point cross sections in (10.42) and integrating over x, 
we arrive at the neutrino—proton scattering 
gO een (1 a (10.48) 
—— = ——|d(x)+u(x)(1-— =} l ; 
dQ?dy xmv E 
Here we omit the contribution from the strange and heavier quarks and their 
antiparticles. In the case of antineutrino—proton scattering we obtain 
do = G? x 


_ v\2 
aortas = oe) ae) +u(x)(I = =) i (10.49) 


It is now evident that the momentum fraction x = Q?/(2M v) is indeed the Bjorken 
scaling variable. 

The general case with many families of quarks can be easily written down. The 
contribution from a quark g(x) and an antiquark g(x) is 


vN 2 
& -= Č MEx la + — yao), (10.50) 
xdy T 


provided that the quark under consideration is allowed by charge conservation. 
Similarly, the antineutrino—nucleon cross section is 
da WN G2 


—_—_ __ — 2 7 
Sas 2MEx[(1 — yq) + G(x)]. (10.51) 


These relations are used to determine the antiquark content of the proton. For 
instance, the antineutrino—nucleon cross section at y = 1 measures 4. The total cross 
sections are also easily derived. They grow linearly with neutrino or antineutrino 
energy. 

Finally the ratio of the total cross sections for an isoscalar target, such as 
deuterium or oxygen, is 


oi fa dxx[jut+d)+@t+d)| 
oi fidex[u+d+ia +D] 


Taking the experimental ratio of the cross sections to be approximately 0.50, we 
arrive at the conclusion that the integrated antiquark contribution is approximately 
20% of the quark contribution. 

We close this section with a few remarks. We derived the general formulas 
for neutrino- and antineutrino-induced reactions using helicity cross sections. We 
have also shown explicitly that they are related to the structure functions. The 
formalism can be carried over to electroproduction, for which similar formulas 
hold. We also emphasized that high-energy neutrino reactions are closely related 
to electroproduction in the deep inelastic region. 


(10.52) 
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Figure 10.4. The Drell-Yan process. 


Both reactions were analyzed in terms of the parton model, assuming that the 
constituents of protons are the quarks (Bjorken and Paschos, 1969). This will be 
further developed in the next two chapters, where the quark-parton content of 
hadrons becomes more evident. 


10.5 The Drell-Yan process 


The production of a massive photon or of a W~ in hadron-hadron collisions and 
its subsequent decay has been successfully analyzed in terms of the parton model. 
The reactions (Drell and Yan, 1970) 


pt+tp-o- W+:----~evt+xX (10.54) 


are known as Drell-Yan processes. Together with deep inelastic scattering and 
electron—positron annihilation, these processes play an important role in determin- 
ing the structure functions and in testing the parton model, including QCD correc- 
tions. The Drell—Yan process was especially important in formulating a strategy for 
seeking and discovering the W bosons. 

To calculate the cross section corresponding to Fig. 10.4, we begin with the 
parton subprocess, 


Ara? 2 
302 eg: 


o(ĝq > +) = (10.55) 


In order to embed it in the hadronic process, we rewrite it as a differential cross 
section, dø /dQ?, for the production of a lepton pair with invariantmass y Q?, 
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where 
OPS SS K + py) (10.56) 
ae €25(Q? — 3). (10.57) 
dQ? 30? 


We envisage each hadron of momentum P being made up of partons carrying 
a longitudinal momentum xP. We make the idealization that the partons carry 
negligible transverse momentum. When the mass of the produced pair is very large, 
the cross section is the incoherent sum of the elementary subprocesses. In this case a 
quark of type g from one hadron annihilates with an antiquark of the same type from 
the other hadron. The probability of finding the quark with fractional momentum 
x is given by f(x) and that for the antiquark by f(y). The hadronic cross section 
can now be obtained (Drell and Yan, 1970): 


© (pp > EX) = Le 2 fax fay Ono 00.58) 


Te A 
where the sum is over all possible gq pairs that can be formed from the constituents 
of the colliding protons and the average is over the number of initial gq states. This 
gives in the end an overall factor of i. 

The q and g carry the fractions x and y of the proton momenta and the invariant 
mass becomes 


$ = (xp, + yP} © xys, (10.59) 


with s ~ 2p, - p2. The cross section now takes the form 


do 4 ma? 
dg? Tg 7 J © J Visa (y)5(Q* — xys). (10.60) 
After integration over y, we obtain the final result 
do 4ra? f! dx Q? 
= a{ —— |- 10.61 
so 50% Ja, HOME) (10.61 


To lowest order (without gluon emission) we expect a scaling result: the last integral 
depends on the ratio t = Q*/s. The scaling is satisfied but the overall rate is 
modified by QCD corrections, which involve gluons. In this case the corrections 
are substantial and the reader should consult specialized articles for more details. 
In addition to the quarks, the proton contains also gluons, i.e. vector mesons 
that mediate the strong interactions. This requires that for each hadron we must 
introduce a gluon distribution function: g(x). In several processes gluons play an 
important role. For instance, the production of Higgses in high-energy colliders 
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proceeds through the fusion of two gluons, 
g+g—-H- ZZ, (10.62) 


with g denoting gluons and the Higgs decaying to two lighter particles (in this case 
Z bosons). The hadronic reaction can be analyzed as a Drell-Yan process with the 
quarks of the intermediate states replaced by gluons. 

Let us consider the process 


p+p— H + hadrons > ZZ + hadrons (10.63) 


and denote by oo(gg — H — ZZ) the point cross section for the production of two 
Z bosons. The gluon distribution function for protons has been measured in DESY 
experiments to be large at small values of x. The cross section for the production 
of Z pairs through two gluons with moments xp; and yp, respectively, is given by 


doo 2 2 
agl OO — xs). (10.64) 
Q 
The cross section for the proton-proton collision is 
do on(Q7) dx Q’ 
— = — =]. 10.65 
dQ? J E 8(x)8 ae ( ) 


One usually takes the gluon structure functions from electron—proton-scattering 
experiments and extrapolates them to regions of small x and large Q* by means 
of the renormalization-group equations. In addition to the corrected structure func- 
tions, the calculations must include corrections to the gluon—Higgs-boson coupling 
induced again by virtual gluons. 


Problems for Chapter 10 


1. Show that Eq. (10.5) is determined up to an overall phase. 
2. Determine the behaviour of ae under rotations around the z-axis. 
This can be done easily if you split ee into e0 = (0, 1, 0, 0) and e9 = (0, 0, 1, 0). 
3. Prove Eq. (10.7) with the following Ansatz: j” = ag’ + bek + ceg. 
Determine a, b, and c using k” = (E, kx, 0, kz), q” = (v, 0, 0, g-), and momentum 
conservation. Apply the high-energy limit v > 2M and Q? <« v? in order to obtain 
Eq. (10.7). 
4. Derive Eq. (10.22) starting from Eq. (10.19). 


5. In order to prove Eq. (10.30) rewrite mh PY as 


„lept __ 


1 / 
jı z” Yu(l + ys + 1 — ys)u 


and follow steps similar to those in the case of neutrino—hadron scattering. 
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6. Check the individual steps leading to Eq. (10.34). 

. Carry out the various steps leading to Eqs. (10.47) and (10.48). 

8. Calculate the helicity cross section for a left-handed W scattered on quarks and show 
that it reproduces the result in Eq. (8.45). 


~ 
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Charged-current reactions 


Charged-current interactions are the most frequent and occur in decays, as well as in 
particle reactions. They have been analyzed in many books, especially those written 
before 1970. Charged-current interactions, especially decays, were instrumental in 
establishing properties of the currents. We can classify them according to the degree 
of our theoretical understanding. The simplest reactions are purely leptonic. They 
are relatively simple to calculate, because the couplings of leptons to currents are 
precisely known and, now that the theory is renormalizable, we can include loop 
corrections. Some leptonic reactions were presented in Chapter 8. We shall not 
study them further. 

The next class of reactions consists of the semileptonic ones, which can also 
be treated successfully with various theoretical methods. They involve a single 
coupling of the currents to hadrons, which can be understood at low energy and/or 
at low momentum transfer in terms of form factors. They are also understood at 
high energies in terms of the short-distance behavior of the currents. We shall 
study several processes in this chapter: deep inelastic scattering and quasi-elastic 
scattering. 

Non-leptonic interactions are the most difficult to analyze. They do not include 
any leptons and involve both strong and weak interactions. The interplay between 
the two interactions is still a developing field of research. 


11.1 Deep inelastic scattering 


High-energy neutrino interactions have been used to probe the inner structure of 
protons and neutrons: these studies were crucial for establishing the quark sub- 
structure of matter and giving quantitative support to the field theory of quark 
interactions (quantum chromodynamics). In Chapter 10 we described the general 
structure of the cross sections and some consequences of the scaling phenomenon. 
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Then we showed that the general features can be explained in terms of the 
quark—parton model. Many more properties and correlations with other reactions 
have been understood and we discuss them here in greater detail. 


11.1.1 Scaling and the charge of the quarks 


The electroproduction reactions couple to the charge of the quarks, in contrast to 
the neutrino reactions, which couple to the weak isospin. Comparison of the two 
processes gives an indication regarding the charge of the constituents. 

The electroproduction cross section is 


do 4ra? 
dQ2dv 4 fa s(» 7 sr) > 2 G dee 


Bei x 


= = D alao) + Gi], (11.1) 


with the normalization chosen to reproduce the Mott cross section. We obtain the 
structure function 


F (x) = 5 [u(x) + a(x)] + = sla) + d(x) + s(x) + sl}, (11.2) 


where u(x), d(x), ū(x), and d(x) are the quark distribution functions in the proton. 
Similar equations hold for electron—neutron scattering, in which the exchanges 
u < d and ū < d take place. The structure function on an isoscalar target is the 
average over protons and neutrons, 


Fœ) = {ew +ā(x) + d(x) + d(x)| + Sa) + sol}, (11.3) 


where the factor of 5/18 follows directly from the fractional charges of the quarks. 
For neutrino-induced reactions the structure functions are expressed in terms of 
the quark distributions 


F3” Œ) = 2x[qi(x) + Gi(x)], (11.4) 
F,” (x) = 2xF Pœ) (Callan and Gross, 1969), (11.5) 
xF3?(x) = 2x[qi(x) — Gi(x)]. (11.6) 


For neutrinos the following elementary processes are possible: 


vd—> u u, 


vu > ud; 
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and for antineutrinos 


We shall assume in this section that the Cabbibo angle is zero, so that scatterings 
from strange quarks are neglected. Then we obtain the structure functions 


Fy? (x) = 2x[d(x) + a(x)], 


FP œ) = 2x[dw) + u(x)] = A") ke 
and, for isoscalar targets, 
FYN(x) = x[u(x) + d(x) + a(x) + d(x)] (11.8) 
and 
xF N(x) = x[u(x) + d(x) — a(x) — d(x)]. (11.9) 


Since the strange-quark structure functions s(x) and s(x) are relatively small, we 
can neglect them and obtain from Eqs. (11.3) and (11.8) the ratio 

FN 5 

Ja 2. (11.10) 

FY‘(x) 18 
The ratio measures the average charge of the quarks and it indicates that the charges 
of the constituents are fractional. These and other relations have shown that the 
constituents of hadrons which couple in deep inelastic scattering carry the quantum 
numbers of the quarks. 


11.1.2 Spin of the quarks 


In the V-A theory all fermions participate in the weak interactions as left-handed 
particles and all antifermions as right-handed particles. For antineutrino—quark 
scattering, helicity conservation requires that the process in the center-of-mass 
frame vanishes at 6.1 = 180° as shown in Fig. 11.1. 

The cross section has the angular dependence 


da™4 


dcos 6 cm 


œ (1 + cos Bem). (11.11) 


The center-of-mass angle Oem is related to the laboratory energies by 


1 + cos Gem Ey 
= =1-y, 11.12 
5 E, y ( ) 
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Before collision — — J= 
V q 

After collision ————— —<  j,--] 
u q 


Figure 11.1. Production of a forbidden configuration by helicity conservation in 
antineutrino—quark scattering. 


Table 11.1. Angular dependences of the 


reactions 
Process J, y Dependence 
vq, VG: = 7 0 1 
19,00 S l d-y? 


which is easily obtained by evaluating the ratio k’ - p/(k - p) in the center-of-mass 
system. In this pictorial manner we understand the y dependence of the cross 
section: 


do™4 n 
x (l1 -— yy. (11.13) 
dy 
A similar study of the reaction 
v+d—>u +u (11.14) 


shows that there is no reason for the cross section to vanish in any direction. In this 
way we construct Table 11.1. 

The cross sections in terms of the various species attain the form with q(x) and 
q(x) contributions from spin-+ constituents and k(x) distributions for spin-zero 
constituents: 


do*N  G?ME~ 


x[q(x) +1 — yA — yk], (11.15) 


dxdy x 

do G?MEs 

= = [a — yaa L kE]. (1.16) 
xdy 


The experimental results indicate that the scattering occurs on spin-4 constituents 
and that the content of scalar constituents is very small. The y distribution for 
antineutrinos is not exactly zero at y = 1 because protons and neutrons contain a 
sea of quark—antiquark pairs in addition to their valence quarks. These pairs are 
created by the emission of vector particles, the gluons, which also bind the quarks 
into hadrons. 
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11.1.3 Sum rules 


In the quark parton model, the distribution functions indicate how the quantum 
numbers are distributed within hadrons. Thus integrals of distribution functions 
must reproduce the quantum numbers of the target. For instance, ois S(x)dx gives 
the probability of finding a strange quark, with any momentum, within a proton. 
Since the proton has zero strangeness, 


1 
| [s(x) — 5(x)]dx = 0. (11.17) 
0 
Similarly we can compute the baryon number and isospin of a proton: 
1f! S 
al [u+d—ia-—d|dx =1 (baryon), (11.18) 
1 
| [u — d) — (ū — dìļdx = 1 (isospin). (11.19) 
0 


Such relations are known as sum rules and have been determined by combining 
data from various processes. 
A good example is the Adler (1965) sum rule 


1 fp vp, qd 
sae >| |F; Px) — F; "œ| =1, (11.20) 


which follows from the isospin relation in (11.19). The Adler sum rule follows from 
current algebra and it must be valid for each value of Q°. In fact, it is a consequence 
of the commutator of two isospin charges and as such is very reliable. Experimental 
results give the value 


Sa = 1.08 + 0.20, (11.21) 


which is in good agreement but the error is relatively large. 
A fast convergent sum rule is the Gross—Llewellyn Smith (1969) sum rule 


2 
[ [FPE 09+ Fe, Pax = 6 - = 
0 


=5.4 at Q’°=3 GeV? 
= 5.00 +0.16 (experiment). (11.22) 


The right-hand side includes first-order QCD corrections. Again the agreement is 
very good. 
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Finally, the integral ae xq(x)dx gives the fraction of the proton’s momentum 
carried by the g quark. Thus 


1 
> = / x[u(x) + d(x) + s(x) + a(x) + d(x) + 3(x)|dx = 0.54 (11.23) 
0 


is the momentum carried by all the quarks inside the proton. This integral was 
determined by combining data from several processes. It is much less than unity, 
indicating that the quarks carry one half of the proton’s momentum. The remaining 
half must be carried by other particles, which do not interact directly with the 
currents. They are the gluons of quantum chromodynamics. 


11.2 Evolution of distribution functions 


The algebraic relations discussed in this and the previous chapter assumed point- 
like constituents within the nucleon. We know from previous advances in physics 
that a particle that looks point-like on one resolution scale reveals substructure at 
a higher resolution. The scaling phenomenon and the numerous quantum-number 
relations revealed a point-like structure, but deviations from scaling indicate the 
existence of additional structure. In fact, it has been established that the variation 
of the structure functions with Q? is due to the emission of vector particles: the 
gluons. As was mentioned earlier, they carry the other half of the momentum of the 
nucleons, that was missing in the momentum sum rule. 

The additional structure is introduced by the theory of strong interactions known 
as quantum chromodynamics (QCD). There are many indications that each flavor 
of quarks comes in three colors: red, white, and blue. The names for the colors 
are arbitrary, but the fact that there are three is important. The quarks interact 
with each other by the exchange of vector bosons that change the color of the 
quarks. 

The theory of the strong interaction — QCD — is a non-Abelian gauge theory 
based on the group SU(3),. Each quark species — up, down, strange, ...— forms a 
triplet in color space and has a coupling g, to the eight gluons, G*, which belong 
to the adjoint representation of SU(3) color. We write the color triplet as 


qr 
q(x) = | qw 
qb 
and the Lagrangian 
> i Qa Qa 
Loco = q(x)iy" Ẹ + 784 coho, (11.24) 
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with à” the Gell-Mann matrices of SU(3), and Gi) with a = 1, 2, ..., 8 the 
eight gluons. There is no mass term for the gluons that leaves the color sym- 
metry exact. QCD makes dramatic predictions. The first one concerns the coupling 
constant. 

In field theories coupling constants and other observables are modified by higher- 
order corrections that involve loops. Many loop diagrams are divergent, which 
demands special handling of them. When all the infinities from loop diagrams 
are absorbed into the definition of couplings, masses, and other parameters of the 
original Lagrangian, we say that the theory is renormalizable. In these theories we 
can calculate physical observables with high precision. QCD and the electroweak 
theory are renormalizable. It is beyond the scope of this book to describe or prove 
renormalization. Instead, we shall describe a few cases of higher-order corrections 
in order to demonstrate the methods entering these calculations. Furthermore, we 
describe some properties of field theories that have significant impact on properties 
of weak interactions. 

One quantity modified by loop corrections is the strong coupling constant gs. 
The infinities introduced by higher orders are absorbed into the redefinition of 
the coupling constant. Since the corrections involve the addition of infinite quan- 
tities, the numerical value of the coupling is unknown and must be determined 
experimentally. Thus a, is measured at a specific reference scale uo, known as 
the renormalization point. In many cases the reference scale u is identified with 
the momentum flowing through the vertex. The arbitrariness of the reference point 
leads to a differential equation — the renormalization-group equation. To be specific, 
the change of (u) = g? /(47r) with respect to the reference point u satisfies the 
equation 


da 
i = BO), (11.25) 
u 
where (œ) represents the sum of higher-order corrections and is of the form 


Bœ) = boa? + O(a”). (11.26) 


The constant fp and higher terms are determined in perturbation theory. The solution 


is obtained as 
a(t) d 
J a =m( £) (11.27) 
a» BO) Ho 


or, keeping the leading term on the right-hand side of Eq. (11.26), we obtain 


aæ(uo) 


E aueneiey. 


(11.28) 
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This states that, knowing the coupling constant at the reference scale uo, we can 
predict its value at another scale u. The coupling constant is no longer a constant 
but runs with momentum; hence the name running coupling constant. 

An important property of QCD is that the value of By = 11 — 2 Ni, with Ne 
the number of generations, is positive. As the momentum increases, the coupling 
constant decreases and, at very high momentum, a,(p) is so small that perturbation 
theory is applicable. This provides a justification of scaling and of the parton model. 
It also goes beyond scaling, by predicting modifications introduced by the emission 
of gluons. The corrections are functions of Q? producing predictable violations of 
scaling. The corrections have been studied extensively in perturbation theory and 
compared with many experimental results. 

There is an extensive list of articles in which violations of scaling have been 
computed and are discussed in detail. Experimentally, the changes have been ob- 
served with the structure functions increasing for small x as functions of Q? and 
decreasing for x > 0.4. 

The second prediction concerns the production of gluons, which are emit- 
ted by the accelerating particles. The quarks produced materialize into hadrons 
and produce jets of particles. Similarly, the gluons also produce jets of hadrons. 
Consequently we expect some reactions to produce two jets (from qq pairs) and 
others three jets (from qqg production). Three-jet events have been observed in 
electron—positron-annihilation reactions. The production of gluons implies that 
they also exist within hadrons and are responsible for the missing momentum 
in the sum rule in Eq. (11.23). Consequently the experimental results have been 
analyzed with the inclusion of an additional distribution function for the gluons. 
A dramatic property of the gluon distribution function is its rapid increase at 
small x. 

For small momenta the coupling constant grows and becomes very big, making 
a perturbative description impossible. It is customary to denote by A? the scale of 
Q? at which the denominator becomes zero. This happens at 


A? = peT loeo), (11.29) 


It follows now that the coupling constant can be rewritten 


1 
Bo In(u?/ A?) 


We can think of A as the boundary between the region where quarks and gluons 


appear as quasi-free particles and the world of bound states like protons, pions, 
etc. At momenta smaller than A, the strong interaction becomes so strong that the 
quarks cannot come out as free particles, but remain confined within hadrons. This 
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property of confined quarks has not been proved yet, and thus it is difficult to judge 
which among several approaches may ultimately be the most productive. 

It is evident that QCD and its consequences form an extensive and exciting topic, 
which is, however, beyond the scope of this book. We shall have occasion to return 
to QCD in Section 15.6, where we discuss the effective Hamiltonian for low-energy 
weak interactions. As a last topic concerning the charged-current interactions we 
discuss in the next section quasi-elastic scattering. 


11.3 Quasi-elastic scattering 


In contrast to deep inelastic scattering, quasi-elastic scattering gives information 
on the static properties of the proton and the neutron. In fact, the first experiments 
with neutrino beams measured the reactions shown in Fig. 11.2: 


v(k) +n(p) > w(K’) + pp’), (11.31) 
V(k) + p(p) > u+’) + n(p’), (11.32) 


which are still interesting on several accounts. For instance, we would like to deter- 
mine their form factors accurately and check their relation to the electromagnetic 
form factors. Furthermore, the quasi-elastic cross sections reach constant values for 
neutrino energies greater than 2.0 GeV. This property has been used for measuring 
the flux of neutrino beams and is still useful. Low-energy neutrino interactions re- 
cently started being used efficiently for studying neutrino oscillations. For all these 
reasons we present in this chapter an explicit calculation. 

For low energies relative to the mass of the W boson, interactions of neutrinos 
can be written as a leptonic current times a hadronic current: 


G 
M= Ween — ys)ulk)(pl Jt In). 


The hadronic current has a complicated structure produced by the motion of the 
quarks within hadrons. We define the vector form factors of the charged current as 


= / .o, v $ 
(p| Vt |n) = u(p (mrt + ie Ey + i FS)u(p) (11.33) 


Denoting, as before, the isovector form factor of the electromagnetic current by 
F N , we obtain 


Fi(@?) = -2F\(q’). (11.34) 
The factor 2 comes from the normalization of Vi = Vi +iV? and the 


Clebsch—Gordan coefficients, which are we for the charged current and -/3 for 
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Vi 


Figure 11.2. Quasi-elastic scattering. 


the electromagnetic current. For the following calculation it is convenient to write 
the hadronic current in a modified form: 


(p+ ath 


(oltm =a) evra + FES aa 


Gs 


+ ays + pie -F haz 


r y (p), (11.35) 


where we left out the factor cos 6, arising from the Cabibbo angle. In reducing 
(11.33) to (11.35) we used the Gordon decomposition formula, which gives the 
relations gy = F A + F$ v= —Fy , and F$ vanishes as explained in Chapter 2. 
The term haq,s contributes to the cross section terms proportional to the lepton 
masses and will be omitted. In Chapters 1 and 2 we discussed the fact that charge 
conjugation and time-reversal together require hy and fa to vanish. On eliminating 
these three form factors, we obtain a simplified form for the matrix element which 
we shall use in this section: 


(P+ Pu 


(p\ J; In) = wip] evr N= 


A savur uw) (11.36) 
The electromagnetic form factors are known from electron-scattering experiments 
on protons and neutrons. Similar values of the axial form factor at low values of 
Q? have been measured in B-decay. We shall use this information at the end of this 
section. 

The calculation of the cross section is now straightforward but tedious. Since 
the calculation of quasi-elastic scattering is not easily available in books, I give a 
few intermediate steps. There is a second reason: the elastic scattering for neutral 
currents has a similar functional form that is obtained by replacing the form factors 
by those of neutral currents. Studies of quasi-elastic scattering frequently use a 
formula in terms of Mandelstam variables (Llewellyn Smith, 1974). We derive 
here two more formulas that are convenient for taking limits in specific kinematic 
regions. 
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The kinematics for the process are simplest in the laboratory frame where the 
nucleon is at rest: 


p-k=p- k= 
EES q? 
DESP REME gs 
2 2 
m q 
eS = 11.37 
5 5 ( ) 
prta 
x es 


0 
Q? =2Mv =4EF' sin?(5). 
The square of the hadronic tensor is obtained from (11.36) 


Huv T (gy + £4) (PuPy oF PvP, a 8&uvP j p = 2igv8gaEuvys p” p° 


2P: P +M 


2 
+ M’ (g3 -ewt R m 


p fusv)(Pp + Pa) + Pe 
(11.38) 


The first line of this equation follows from Eq. (8.37) and the remaining ones from 
a straightforward calculation. The similarities between Section 8.3 and the present 
one can be used for comparisons. For instance, the inner product of the leptonic 
tensor in Eq. (8.36) with H,» leads to a matrix element that can be expressed in 
terms of the scattering angle. From the matrix element and the phase-space integral 
we arrive at the differential cross section 


do G} EF' Q? 6 [0 
qa a z | (z gy +e) + fa s(5) | + GA = edsin ( 2) 


, 6 2 0 
= 2erev(= )sin2($) + ja(u+ E) + 2fvgv leos? (3)} 


(11.39) 


Several limiting cases are now interesting. For ga = 0 the cross section depends 
only on the vector terms and the functional form agrees with the Rosenbluth formula. 
Differences between electroproduction and neutrino-induced formulas arise from 
the photon propagator and the coupling constants. 
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Alternatively, we may combine the gy and ga terms and obtain another expres- 
sion for the cross section: 


do GM E'N? Mv 
= MN ee = a0) + (gv + ga) (=) + (84 — 8v) Sr 


dE’ 4r 
Q? E' 2 Q? Q? 
ela) +2000 (14 =) 2 (142) l 


(11.40) 


For fy = 0 the interaction of the neutrino has the same functional form as neutrino— 
electron scattering and the expression above agrees with Eq. (8.43). 

The two equations for quasi-elastic scattering presented already are convenient 
for taking specific limits. It is customary, however, to use another formula, which 
expresses the differential cross section in terms of Mandelstam variables s, u, t = 
q? (Llewellyn Smith 1974). Most recent analyses use it and have been able to 
account for the experimental data in terms of three form factors. The vector form 
factors are related to those measured in electromagnetic reactions. The axial form 
factor is parametrized 

Gy SE N (11.41) 
(1 — 4?/Mj) 
with g4(0) = 1.26 and Ma = 1.00 + 0.05 GeV/c? (its precise value is still being 
debated among the experts). 
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Neutral currents in semileptonic reactions 


12.1 Neutrino—hadron neutral-current interactions 


The first experimental support for the electroweak theory came from the observation 
of neutral currents in semileptonic reactions. Neutral currents appear because the 
product SU(2) x U(1) contains two neutral generators. We have shown that one 
linear superposition of generators is the electromagnetic current and the second is a 
neutral current. In Chapter 8 we discussed leptonic neutral-current reactions. In this 
chapter we deal with the observation of neutral currents in semileptonic reactions 
and, in particular, neutrino—hadron interactions. 

The coupling of the Z, boson to leptons was given in Eq. (8.11) and that to 
quarks in Eq. (9.20). The neutral-current neutrino-hadron interactions have the 
general form 


3 
Hett = [dy "(1 — ys)v] ` (qi T3Yuqi — sin?’ðw ĝi Q yuqi) 
j=l 

3 


Salo 


— gPa — ys\v] (xV? + yA], + yV? +849), (12.1) 
where vi and Ai, are the isospin partners of the charged currents. V9 and A9, are 
isoscalar currents for which there are several possibilities. For comparison we give 
the normalization of vå in terms of quarks: 

ee ce 2 
V= 5 (Yuu = dyd), 


u 


1 i (12.2) 
A? = z (EYuysu — dy,ysd). 


We could have defined v3 abstractly, in terms of its isospin transformation prop- 
erties, but, now that quarks permeate our daily language, this notation is appropriate. 
The interested reader can always revert to the transformation properties. Similarly, 


124 
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we define the isoscalar currents 


1 £ 
v? = 5 (E Yuu +dy,d) +--+, 


: À (12.3) 
Al, = 5 (a Yuysu +dyy,ysd) +--+, 


where - - - involve Ss and ĉc terms. With this normalization, the isoscalar piece of 
the electromagnetic current is 4v? . In the electroweak theory 


= 1 — 2sin’Oy, =-l, 
7 iad: ae (12.4) 


Se 
p= le Ow, 6 =0. 


The vanishing of 6 is a specific property of the standard model when we consider 
only up and down quarks. It is non-zero as soon as strange and heavier quarks or 
higher-order corrections are introduced. 

The weak mixing angle @w is the same angle as that introduced in the leptonic 
sector. The first issue was the existence of the neutral currents. This was a diffi- 
cult experimental problem because neutral-current interactions were new and the 
experiments had a large neutron background. 

After the discovery of neutral currents, there was still interest in establishing that 
they belonged to the standard model. As problems, there remained 


(i) verification of the Lorentz structure of neutral currents as vector and axial-vector oper- 
ators, and 

(ii) verification of the internal symmetry structure as a superposition of isovector and 
isoscalar operators in terms of a mixing parameter: sin*@w. 


In analyzing these issues there are two separate kinematic regions where we know 
the hadronic matrix elements of the currents. One region is deep inelastic scatter- 
ing, where the structure functions have been measured and have been explained 
successfully in terms of quark-parton distribution functions. The other region in- 
volves low-energy experiments, for which form factors for elastic scattering and 
the excitation of the A(1232) resonance are already known. In the next few sections 
we study reactions that allow us to decipher the couplings of neutral currents to 
hadrons. 

When the standard model became popular, it appeared very important to dis- 
cover neutral currents. It was also fortunate that experiments with the capability 
of searching for them were running or were beginning to run. It was not clear, 
however, how large neutral-current cross sections should be. There was a need for 
theoretical predictions. At that time the quark-parton model was in its infancy and 
its predictions were frequently questioned. 
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Thus theoretical predictions were carried out at two levels. One approach was 
through the symmetry properties of the currents relating V3 and ve to the charged 
and electromagnetic currents. The other approach was to calculate cross sec- 
tions in the quark-parton model. Nowadays we know that both approaches are 
correct. 


12.2 Model-independent predictions 


The simplest processes to consider are those involving total cross sections on isospin 
neutral targets. We define 


1 
o= ze +P > wo +X) +00 +n > pw +X)] 
and 
1 
pe slow +P >v+X3)+o(vt+n> v+X,)]. (12.5) 


An incoherent sum over all possible final states that yields an isoscalar final state 
is assumed. For the charged-current cross section we write 


o =V+A4I, (12.6) 


where V comes from the vector current alone, A from the axial current alone, and 
I is the interference term. We can represent them as follows: 


V= So KXle- Vip, A=} KXle- Alp)’, 


and 


I =29_ Re((X|e- V|p)* (X| e- Alp)), (12.7) 


with the sums running over all final states and polarizations of the W boson. In 
Eqs. (12.6) and (12.7) an average over protons and neutrons is understood. 

The vector currents are isovector quantities that are related to the isovector part 
of the neutral current through an isospin rotation. The neutral current contains in 
addition an isoscalar term, but, since we consider isoscalar target and isoscalar final 
states, the isoscalar—isovector interference drops out. It follows now that 


l2 2 
% = 5 V+xI+A+y S), (12.8) 


where S is the contribution of the isoscalar current. The overall factor of 1/2 follows 
from the fact that the charged current transforms like the generator /2t* of SU(2) 


12.2 Model-independent predictions 127 


and the neutral current like t?. Since y? > 0, 


1A+xI4+x7V 
fc 20 ge Se (12.9) 
o- ^2 A+I+V 


Furthermore, Schwarz’s inequality implies 
4AV > P. (12.10) 


On combining the two inequalities (see Problem 1), we arrive at 


fath 1 a HT 12.11 
P re 


The term V can be deduced from knowledge of the isovector contribution to the 
electroproduction cross section 


1 
Gem = z|o (e +p > e+ xı) +o(e +n > e+ x)]. (12.12) 


Not knowing the isoscalar contribution, we use again inequalities, 


G 4 
v< T ia ae: (12.13) 


which gives the final result 


142 
1 Vem \? 
R> if - 2sin?6y (“*") | f (12.14) 
(0 


This derivation makes judicious use of inequalities. Within the electroweak theory 
the method is model-independent and holds for many physical processes. When we 
plot R versus sin*@w there is a minimum for the ratio; for similar inequalities see 
Pais and Treiman (1972). 

One may also use reactions induced by antineutrinos to obtain additional rel- 
ations. On going over to antineutrinos one must change the sign of the interference 
term 7. The charged- and neutral-current cross sections on isoscalar targets are, 
respectively, 


o,=(V+A-D), (12.15) 
ee 2 2 
5) = 5(A+x V—xI+y’S). (12.16) 
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On combining Eqs. (12.6), (12.8), (12.15), and (12.16), we obtain (Paschos and 
Wolfenstein, 1973) 


00 — 00 1 


R_ = ——— = ~(1 —2sin’6y), 12.17 

A a sin*6w) (12.17) 
5 1 10 

is ae = G — sin? + 5 sintów ). (12.18) 


These relations are truly independent of any details of scaling violations and elim- 
inate some theoretical corrections inherent in the quark-parton method. They are 
frequently used to determine the mixing angle sin? Ay. 

In the above derivations we set the parameter 


Mw 


= ae 12.19 
Mi cos? ( ) 


p 


equal to unity. This is the lowest-order value, which appears also in Eq. (8.19), 
but radiative corrections will modify it. Extensive analyses of the data including 
radiative corrections from the top quark and the Higgs meson gave the value 


p = 0.9998 10:0055 and My < 1002 GeV, 


which is indeed very close to unity. This is a confirmation of the SU(2) structure of 
the theory and we will continue giving p the value unity. 


12.3 Neutral-current cross sections 


It is perhaps more transparent to discuss the various cross sections in the parton 
model (Sehgal, 1973; Kim et al., 1981). The effective Lagrangian density was 
written, at the beginning of this chaper, in terms of the first generation of quarks. 
We re-express the effective interaction in terms of chiral couplings, 


G 
L= oe — ys)v{ay, [UL — ys) + uR( + ys)]u 
+ dy"[d (1 — ys) + da(1 + ys)ld +-+}, (12.20) 


with uy and up the couplings of the left- and right-handed up quarks and with 
a similar definition for dy and dg. The ellipses indicate again contributions from 
higher generations. We adopted this notation because it is convenient to write down 
the elementary cross sections as they were classified in Section 8.3 in terms of the 
chiralities of the leptonic and hadronic vertices. The new couplings are related to 
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those defined at the beginning of this chapter as follows: 
1 
Bis Co eee) 


1 
mE Gaia aes ame 


1 (12.21) 
d = Tie a TAN 
1 
dg = I ESTY 
The neutrino experiments determined combinations of uy, . . ., dr, which then were 


translated into x, y, y, and ô, thus testing the isospin and parity content of the 
current. Finally, they were all determined in terms of a single mixing angle sin? 6w. 
The expressions become rather long and it is convenient to introduce a shorter 
notation. We denote generically by f4 and fj the parton distribution functions for 
q and q and their left-handed or right-handed couplings by q, and qr, respectively. 
One easily finds cross sections for the elementary processes 


donc(vq) _ 2G°ME 


2 2 2 
eo = a xfg(x)[ap + RA- y)’], 
donc(vq) 2G?ME 
T xf lak +a — yy], 
i aee (12.22) 
onc yq 2 2 2 
— : Ly 
dx dy T xfa [ar tqil y) | 
donc(¥q) 2G°ME 2 2 2 
= a Lat. 
dxdy zs xfa la, + qg. y) ] 


There are various ways to combine these cross sections and isolate the coupling 
constants. In experiments with isoscalar targets, 


fue) = fax) = fr) ad fax) = fax) = f(x). 
Furthermore, we can integrate over y and set K = 2G*ME/r to obtain 
cnc) = K| (7 + 7) ut +a?) + (59 + FeR], 
x 3 3 

cnc) = Ka| (37+ 7) (ut eae) o + z7], 
K 3 : (12.23) 
docc(vN) ( i 

dx 
docc(WN) ( 
dx 
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Most experiments measure ratios of cross sections, where the flux of the neutrinos 
drops out. A popular ratio is 


= onc(VN) = ( 2 
Oocc(WN) 


2-B 
u? + dẹ) + IRUR + dk), (12.24) 


with 
z h wlw- Fo] 
h dx x[ Ff) + F@)| 


measuring the relative strength of the valence- and the sea-quark contributions. For 
instance, B = 1 corresponds to vanishing sea contribution. For the experimental 
value B = 0.8 the ratio becomes 


(12.25) 


1 50 
R, = = sin?Oy + a sin’ Ow. (12.26) 


The experimental values for Ry and Ry are 
~ = 0.29 + 0.01 and y = 0.34 + 0.03. 


In order to compare them with the prediction of Eq. (12.23) it is necessary to 
include precise quark distribution functions. They include contributions from sea- 
quark (s, 5) and (c, ©) pairs of the target. In addition, scaling violations, which have 
been established and analyzed in charged-current reactions, must be included (Kim 
et al., 1981). The analysis yields 


u? +d? =0.29+0.01 and uł +d = 0.03 +0.01, 


which lead to the value sin?Ow = 0.228 + 0.001. 


12.4 Parity violation in electron scattering 


Effects of weak neutral currents in low-energy (Q? < MŽ) electron—hadron reac- 
tions are submerged in the dominant electromagnetic interaction. For these reac- 
tions we must search for a clear signature of weak origin, such as parity violation. 
Experiments of this type have been carried out in deep inelastic electron—hadron 
scattering and in atomic physics (see Problem 4). The couplings of the Z boson to 
electrons and quarks have been discussed already. 

A parity-violating observable is the difference of cross sections for right- and 
left-handed polarized electrons. These are electrons polarized along their direc- 
tion of motion, i.e. electrons with definite helicity. Since helicity changes sign 
under spatial reflection, a difference between the two cross sections is an indication 
of parity violation. We denote the left-handed and right-handed electrons by the 
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spinors 


1 
eLR = 51 F ys)u(k), (12.27) 


respectively. Their interactions at high energies with protons and neutrons are de- 
scribed with sufficient accuracy by the parton model. Consequently, we can write 
the hadronic neutral current as 


I(x) = üy luL(1 — ys) + ur + ys)ļu + dy„ldL(1 — ys) + da + y5)]d 
(12.28) 


as it appears in Eq. (12.20). The interaction of the electrons with hadrons now 
involves the exchange of a photon and a Z boson. In cross sections there are contri- 
butions from the electromagnetic amplitude and weak terms. The latter contribution 
is responsible for the asymmetry. The amplitudes are given as 
2 
ie“ _ a 
my = —~>e Ye (eyuy*u+---), (12.29) 
q 
and 
ig? 
cos?6w(q? — MZ) 


x [urāy”(1 — ys)u + urūy”(1 + ysu + +++], (12.30) 


mz = SLeLYueL + 8R ERYpeR) 


where eu is the charge of the up quark and the ellipses indicate contributions from 
other quarks. For the Z-boson couplings, gı, and gp are the helicity couplings to 
electrons while uy and up are the corresponding couplings to the up quark. For 
the computation of the interference term we follow the presentation of Section 8.3, 
where it was shown that, in the squared amplitude, the following conditions hold. 


(i) The electron bilinears are left-handed or right-handed. The same is true for the quarks. 
(ii) When left-handed leptonic couplings combine with left-handed quark couplings then 
do /dy is independent of y. The same holds for right-handed leptonic couplings with 
right-handed hadronic combinations. 
(iii) When left-handed leptonic couplings combine with right-handed hadronic couplings, 
then the dependence is (1 — y)’. 


The form of the interference terms now follows: 


do, 


dady % Loum + suua(l — yy uate, (12.31) 
dor 2 

x [eru — y) + grur|u(x) +, (12.32) 
dx dy 


with the subscripts L and R in the cross section denoting left- and right-handed 
polarized electrons and the ellipses indicating contributions from down quarks. 
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The parity-violating observable is built into the asymmetry 


= dor = doy, 


= ——__,, (12.33) 
dog + doz 


which is easy to derive from (12.31) and (12.32). To arrive at the final result, we must 
include the down quarks. In an isoscalar target, such as deuterium or carbon, there 
are equal numbers of up and down quarks, so only the combination u(x) + d(x) 
appears in the cross sections, which drops out in the asymmetry. On collecting the 
various terms together, the asymmetry is expected to be 


2 _ “7 yy 
GO aE bot Za 


= = a 
Viana 5 SETET 


with aj = 1 — (20/9)sin? ðw and a2 = 1 — 4 sin?Ow. I have given several steps of 
the derivation so that the interested reader can reproduce it using the various cou- 
pling constants given in the book. The magnitude of the asymmetry for Q? = 
1 GeV? is 


(12.34) 


x —1.6 x 107+. 


The effect was observed at the Stanford Linear Accelerator Center (SLAC) (Prescott 
et al., 1978, 1979). Electron—proton and positron—proton collisions have been ex- 
tended at HERA to very large values of Q? = 400-40000 GeV?, at which the 
effects of the Z propagator are also observable. 


Problems for Chapter 12 


1. Make judicious use of the Schwarz inequality to prove Eqs. (12.11) and (12.14). 
2. Select the Feynman rules for the electron—hadron reaction and show that the effective 
interaction has the form 


e G 7 ks 
Hirt = pE — gayse[ityy' (Va + auys)u + dy" (Va + ag)d +++], 


where 
1 


1 
= 2 i 26 ; = —-; 
&v z sın Ow 8A 5 


5 
II 


8 
(: = sinž0w ), a = — l; (12.35) 


4.3 
Va = — 1 — z sinw J, aqa = l; 
and the ellipses stand for strange and heavier quarks. 


3. Combine the results of the previous problem with the outline of Section 12.4 and obtain 
the final form of the asymmetry. 
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om C= 
© 


Figure 12.1. A schematic drawing of a Z exchange in an atom. 


Figure 12.2. Feynman diagrams for electron—nucleus interaction. 


4. Another manifestation of neutral-current interactions appears as parity violation in 
atoms. The neutral current introduces a new interaction between the orbiting electron 
and the nucleus. The total force in the atom is the sum of electromagnetic and weak 
diagrams (Fig. 12.1) or can be expressed in terms of Feynman diagrams (Fig. 12.2). 

The sum of the amplitudes contributes 


2 
/ e em 
m = ë(k Woa a NN, IN) 


2 
& s(L! 1 NC 
——— glk)y” k)—— (N|J; IN), 
+ cosy elk Jy" (ev + gays)el ae [Ja IN) 
where q, = ky, — k, = P, — P,- In order to identify weak effects, a signal with parity 
violation is required. 
Two parity-violating amplitudes are 


G j : em 
Mı = mek gay“ yse(k){ (NIV, |N) — 2sin’Ow(N|Jo™|N)} 


and 

G 
J/2 
For sin?@y = 0.25 (which is close to the experimental value), only the M, amplitude 
survives. For this reason and because of the suppression of the hadronic matrix element 
in M), we discuss below only M4. 


M = ek Jevy"e(k)(N|A,|N). 
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The momenta involved in atomic experiments are small, so it is convenient to obtain 
a non-relativistic limit of the weak interaction. 
(i) For |k| < me and |k’| < me, show that 
1 Jo 1 
q? — M} 4x r 


The weak interaction is of short range and, in the limit of large Mz, it acts at the 
origin, where the nucleus is located. Later on we replace the Yukawa potential by 
a three-dimensional ô-function. 
(ii) For reduction of the hadronic matrix element to the non-relativistic limit, consider 
p= py. 


(N, DJG" IN, p) = Hp) Fla?) + ops aT 


rau. 


Show that, in the non-relativistic limit, only the u = 0 component survives and gives 
the charge Q of the nucleus. Similar arguments for the v3 matrix element give 


1 
(N, p'IVÈIN, p) = 8u05(Z — N), 


with Z the number of protons and N on the right-hand side the number of neutrons 
in the nucleus. On combining the results from steps (i) and (ii), we find that the 
nucleus generates, through the M, amplitude, the potential 


G 5 
Zr) = sugi Ow, N), 


with Qw(Z, N) = 4[Z( — 4sin?@w) + N]. 


(iii) The transition-matrix element of the electron is 


5 3 
(e¢| Hpylei) = / ee )YpysZpei(r ar. 
For atomic physics the electron wave function can be written as 


1 
e(r)=| -9 |y), 
2m, 


with w;(r) the space wave function of level i, mẹ the mass of the electron and v 
the momentum operator. Show that the matrix element reduces to 

G 1 
/2 2me 
The effect of the neutral current on an atomic level is to induce the mixing of levels 
with opposite parities. Thus the absorption rates of beams of monochromatic light 
with various polarizations by atoms differ. One effect of parity violation in heavy 
atoms is the rotation of the plane of polarization of laser light passing through 
atomic vapors. Such experiments have been performed and effects of the neutral 
current have been observed (Bouchiat and Bouchiat, 1974). 


(e¢| Hpy lei) = 


Qw J Pr ROE POPO: yo. 
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Physics of neutrinos 


13.1 Neutrino masses 


Neutrinos as elementary particles have remarkable properties. They have only weak 
and gravitational interactions, which allows them to travel through matter making 
very few interactions. They carry a global quantum number, known as lepton num- 
ber, which can be broken without disturbing the conservation of electric charge. 
The breaking of lepton number resides on the mass matrices which we introduce 
in this section. 

Their unique properties have led to several discoveries, the newest among them 
being neutrino oscillations, which provide information on their mass differences 
and mixing parameters. Neutrino oscillations also create new questions concerning 
their properties, which are now under active investigation. 

Masses for quarks and leptons were introduced in Chapters 8 and 9 through 
Yukawa couplings to the Higgs doublet. The neutrino remained massless because 
interactions of right-handed neutrinos had not been observed. This is a unique and 
unfamiliar situation, because all other fermions have right-handed components. In 
this chapter we shall describe the properties of neutrinos and introduce right-handed 
neutrinos Nr, which are singlets under SU(2),.. The representation content for the 
electron family is 


Y = B , Nr and er, (13.1) 
. L 


wither, = 4a + ys)e. We should have written the right-handed state as vg; how- 
ever, right-handed neutrinos will play later a special role (Majorana), so we decided 
to denote them Nr. There is an analogous classification of the leptonic states for 
the muon and tau families. 

A mass term of the form 


mMpDÝLNR + h.c. (13.2) 
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is possible and is generated from the Yukawa coupling 


LY => fw VL ONR + he., (13.3) 
where ® is the standard Higgs doublet and 
6 =ino* = Eo breaking G x n (13.4) 


We call this the Dirac mass term. The discussion so far is similar to that of quarks 
and charged leptons. However, the observed extreme smallness of neutrino masses 
seems to require a special treatment. It is possible to introduce another term, known 
as the Majorana mass term, which brings special properties and is a candidate for 
explaining the small masses. 

Before we proceed with new properties of the neutrinos, it is instructive to point 
out which spinors correspond to the above states. The identification is helpful when 
we proceed with calculations. The Dirac equation has four solutions: two solutions 
with E > 0 and two with E < 0, which describe particle and antiparticle states to 
be denoted by u and v, respectively. We follow standard textbook notation with 


pisue" and w34=v,e* fori= 1,2, (13.5) 


where, for particles moving in the z-direction, the four spinors are 


1 0 
E+m 0 E+m 1 
uj = 7 Pz , u2 = 7 0 , 
m E+m va —Pz 
0 
Etr (13.6) 
Pz 0 
E +m E rA m E+m -Pz 
v = ; v = E+m 
2m 1 2m 0 
0 1 
The normalization of the spinors is as follows: uju; = 6;; and v;v; = —4;;. 


For massless neutrinos the spinors are eigenfunctions of the operator ys and we 
define the spinor for the neutrino as 


i 
v= (5) (13.7) 


and the spinor for the antineutrino as 


1 
p= ( teu, (13.8) 
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In this limit we speak of left-handed neutrinos and right-handed antineutrinos. We 
also notice that in the limit p, >> m there are only two independent spinors. 

For massive particles the situation is different. Considering the Hamiltonian of 
a free Dirac particle, 


H =ca- p+ mc’, (13.9) 


where Ba = y; and B = yo, we notice that ys does not commute with the Hamil- 
tonian since [ys, yo] # 0. Thus, for massive neutrinos, ys is not a good quantum 
number; i.e. the spinors are not eigenstates of the ys operator. For the massive case 
we introduce another operator, the helicity 


7-9 fie i 
T a5 (“ a| (13.10) 


h = 
p 2 XO øo 


1 
0 
It is easy to verify that the spinors uw; und v are eigenfunctions with helicity . 
Similarly, u2 und v2 are eigenfunctions of the helicity operator with eigenvalues 
—5. In a given Lorentz frame, helicity in a reaction is conserved. However, the 
helicity of a massive particle depends on the frame, because, by moving very fast, 
we can reverse the momentum of a particle, leaving its spin unchanged. 

A new mass term of the form 


EEEE 
5MmÑRANR + h.c. (13.11) 


is allowed to be present, since it is Lorentz- and SU(2),-invariant. This is known 
as a Majorana mass term and is unique to neutrinos, which are neutral particles. A 
Majorana mass term conserves electric charge but changes the lepton number by 
two units. It may be introduced to the Langrangian as an additional term or as a 
new interaction term, coupled to a new scalar particle which is an SU(2) singlet. 
A Majorana mass is generated by assigning to the new scalar particle a vacuum 
expectation value. Consequently, we generate Dirac mass terms through Eq. (13.3) 
and a Majorana term through Eq. (13.11). On collecting these terms together, one 
obtains the neutrino mass matrix: 


= ae 0 Mp VL 
(PL NR) E A ( a (13.12) 


Once we have introduced mass terms that do not preserve the original symmetry, 
we must solve the problem again using the Lagrangian. This means that, after 
introducing Dirac and/or Majorana mass terms, we should solve the problem using 
the rules of the new Lagrangian (see Problem 13.3). For our specific case, we can no 
longer use the eigenfunctions of Eq. (13.6). We must diagonalize the mass matrix 
and use the new mass eigenstates (physical states) which will have components 
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that couple the various flavors with each other and, in addition, couple particles to 
antiparticles (when Majorana mass terms are present). 

For Mm >> mp the mass matrix of the form given in Eq. (13.12) has eigenvalues 
with specific properties: one eigenvalue is large and the other is suppressed. This 
method of introducing masses for neutrinos is known as the see-saw mechanism, 
to which we shall return in the fourth section. In the meantime we shall discuss 
neutrino oscillations in free space and in matter, which are active fields of research 
nowadays. 


13.2 Neutrino oscillations 


The couplings of Dirac neutrinos to charged and neutral currents conserve the 
lepton number, as has been tested in many experiments. Lepton flavor can be 
violated in the mass matrix, with the appearance of off-diagonal elements. In this 
case the eigenfunctions of the Hamiltonian are superpositions of neutrinos with 
various flavor numbers. We shall call them the mass eigenstates and they have the 
time development given below in Eq. (13.14). In a physical reaction, however, the 
neutrinos that are produced have definite flavor number. Their time development 
requires special attention because we must rewrite the flavor states in terms of mass 
eigenstates, whose time development is that given in Eq. (13.14). This mis-match 
between the production of flavor states and the time development of mass states 
leads to an oscillation of lepton quantum numbers that will be described below. 
We demonstrate the mixing phenomenon for two generations of neutrinos, for 
which the algebra is simpler. We consider ve and v, neutrinos with the mass matrix 


3 
a(z) = H J with mass = (es a (13.13) 
Ot \Vu Vu Mue Myuu 


The mass matrix mixes electron and muon neutrinos in a manner analogous to 
atomic physics, where energy levels are mixed through the interactions with external 
fields, such as magnetic fields. For the sake of simplicity, we assume that the 
matrix elements are real and in addition that mue = Mep. A symmetric matrix is 
diagonalized by an orthogonal matrix to be denoted by O and has the eigenvalues 
mı and m2. A mass eigenstate with momentum p has the time development 


IE) = [v0 7, (13.14) 


withi = 1 or 2 and E; = ,/p? + m?. To avoid confusion, we shall use two types of 
subscripts, with numbers denoting physical eigenstates and letters denoting flavor 
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states. Let us parametrize the orthogonal matrix as follows: 


cosô sin 
Os Ge a 


then the mass eigenstates are given by 


vi(t)\ _ (cos@ —sind) (velt) 
ey 7 Ge cos 8 ie) (13.15) 
We can easily invert this equation to obtain 
ve(t)\ _ ( cos@ sind \ vilt) 
fae 7 eee cos ») ey: (13.16) 


which, with the help of Eq. (13.14), can be rewritten as 


|ve(t)) = eE |v (0))cos 0 + e™* |v(0))sin 0, 
[vu(t)) = —e™™ Ivi (0)sin @ + e™™? | v(0))cos 0. (13.17) 


The general structure of these equations is summarized by 


Iva(t)) = D> Uncen v0), 


k=1,2 


with the unitary matrix U,, defining the mixing between flavor and mass states. 
The mixing matrix was introduced a long time ago and is referred to as the MNS 
matrix (Maki et al., 1962). 

On replacing v;(0) and v2(0) by the flavor eigenstates given in Eq. (13.15), we 
obtain the final result 


|ve(t)) = (cos?6 e!#"' + sin? e~?) |ve(0)) 
+ sin 6 cos 0(e 12" — e'*'")|v,,(0)), 


|vu(t)) = sin@ cos 6 (e™!®” — e!#"")|y.(0)) 
+ (cos?0 e “iE! + sin? eo iO) 


(13.18) 


This shows explicitly that the flavor content of the wave function changes with 
time. For example, at t = 0 the first equation contains only an electron neutrino. 
In the course of time a v~, component develops. The second equation describes a 
state which starts as v,,(0). 

We consider a state that starts as ve(0) and compute the probability of finding a 
v.(t). The masses of the neutrinos are small relative to their momenta and one can 
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use the approximation E; = ,/ p? + m? x Efl + m? /(2E?)] to arrive at 


2 
Poe(t) = |(velve(t)) (7 = 1 — sn*(20)sin?( 71), (13.19) 


with Am? = m5 


time f, 


— m. Similarly, we compute the probability of finding a v,, at 


2 
PO = |(vulve(t))? = sin®(20ysin?( Sr). (13.20) 


The sum of the two probabilities is equal to unity. A mono-energetic neutrino 
beam thus oscillates with amplitude sin?(20) and wave-number Am? /(4E). For 
oscillations to occur, we need a non-zero @ and at least one non-zero mass. The 
amplitude is maximal for 0 = 7/4. 

In vacuum-oscillation experiments there is a redundancy in the values of the 
mixing angle. The same numerical value of sin?(20) appears eight times when 6 
varies between zero and 27. This means that in oscillation experiments it suffices to 
consider angles in the interval between zero and 7/4. In other situations for which 
the mixing depends on cos(2@) the range for the mixing angle must be extended 
from 0 to 2/2, as we shall discuss at the end of the next section. 

We can express the various quantities in useful units and substitute t ~ L, where 
the speed of light c is set equal to unity, 

Am? L L Am? GeV 
= 1.27 — —~ —_. 
4E km eV? E 


This entity is a dimensionless quantity and we are free to use the units. The units 


(13.21) 


in Eq. (13.21) are now standard and convenient for terrestrial experiments. This 
formula defines the sensitivity of an experiment, since oscillations occur when 
Am?L/E is of order unity. As a function of the baseline length, the maximum of 
the oscillation occurs at L ~ 2E/Am?. 

The generalization to three or more families is straightforward and leads to the 
transition probability 


Am? 
(ve voe(t)) |? = dap — zrel bln — s|- r z) i 


j>i 


(13.22) 


For two neutrino families, Uw; is a simple 2 x 2 orthogonal matrix. Upon substi- 
tution the probabilities simplify to Eqs. (13.19) and (13.20). For three families a 
hierarchy of the form Am‘, > Amî, an Am}, and the smallness of one mixing- 
matrix element (U,3) describes all existing experiments. 
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Finally, we present a simple description of what happens in oscillations. A |v,,) 
is composed of various waves with different m;. Ata given energy the heavier mass 
states oscillate faster and the various |v;) components come out of phase, so at a 
certain distance they do not sum up to a |v,). Since the oscillatory term comes from 
an interference between the different m;, a common phase factor of the |v;)s plays 
no role and can be ignored. This will be useful in the next section, when we turn to 
oscillation in matter. 


13.2.1 Oscillation in matter 


The neutral-current interaction of neutrinos with matter is extremely weak, but 
can nonetheless affect oscillations (Wolfenstein, 1978). The reason is that the mo- 
mentum transfer to the target can have a large wavelength, such that the neutrino 
interacts coherently with all the particles within its wavelength. At the same time 
the difference E; — FE, ~ Am‘, /(2E) may correspond to a wavelength of the same 
magnitude, so the time development of the mass eigenstates is influenced by the in- 
teractions with the medium. In the medium the flavor eigenstates ve and v, interact 
with electrons and protons with different cross sections and modify the development 
of the mass eigenstates. 
The Hamiltonian on the mass basis is given by 


OW; 
i 


m? m? 
= PY = EY 1 ! W; —_W,, 13.23 
ae TM p( F m) ? 2E me) 


where after the arrow we omitted the term proportional to the unit matrix. This 
term does not influence the oscillations because it can be eliminated by the trans- 
formation v; —> v;e~'?’ common for all neutrinos. For this reason we can simplify 


the Hamiltonian, 
v1 1 m? 0 v1 
y= E (esse) ‘ 13.24 
ours )=ae( m)lny e 


The interactions with matter, however, involve flavor states. For this reason 
we transform the equations to the flavor basis where we include the interactions 
with matter. The solution of the new eigenvalue problem describes the propagating 
eigenfunctions. In the flavor basis 
ES i m 0 gie 1 (£ — Am’ cos(20) Am’ sin(26) 

T 4E\ Am? sin(26) E + Am? cos(20) J’ 
(13.25) 
2 
1 


where © = m? + m3, Am? = m? — m3, and U is the mixing matrix in Eq. (13.16). 
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In a medium there is an additional interaction Hamiltonian created by the neutral 
and charged currents. Recall the interaction term from Chapter 8, 


Gp - 2 
Hin = 5 Pryale — gå ysr VA — ys)Wre- (13.26) 


Here x denotes weakly interacting particles in the medium, which in the Sun are 
electrons, neutrons, and protons. For left-handed neutrinos (1 — ys)Wy = 2wy. The 
term (YxYu ys Yx) reduces in the non-relativistic limit to the expectation value for 
the spin operator. In an unpolarized medium, the states Yx occupy all possible spin 
states and this term averages to zero. For the remaining term (W/W) the space 
components are proportional to the momentum of the particles, which is small. 
Consequently only the term (y,yoW,) = ny survives and is equal to the density 
of the particles n,. This term produces the potential g%/2G En, vi, wy, which is 
added to the Hamiltonian. Finally, we obtain the Schrédinger equation from the 
Euler-Lagrange equation for wi. 

We already know the couplings of neutrinos to electrons from Section 8.3, where 
they are summarized in Table 8.1. The couplings to nucleons are obtained in a similar 
way: 


gy = 5 + 2sin’@y for Ve, 
gy =—542sin’Oy for vy, 
(13.27) 
g = 5 — 2 sin? Ow for Va, 
gy = -i for Va, 


with @ being either the electron neutrino or the muon neutrino. The difference 
of 1 in the gy, contribution between ve and v, comes from the Fierz-transformed 
charged-current term in the Hamiltonian. In an electrically neutral medium ne = np, 
therefore 


—!n,t+tn. forve, 
v= vice | a i z (13.28) 


—5Mn for vu. 


The new contribution from the scattering with matter must be added to the ee and 
up elements of Haavor, yielding a propagation equation 


d [Ve 1 (m? + 2./2EGegn, me, Ve 
i= = — : (13.29) 
dt Vu] 2E m? mu Yu 


eu HH 
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with matrix elements 


m2, = —5 Am?’ cos(28), 
Mia = 5 Am? cos(280), (13.30) 


m3, = 5 Am? sin(26). 


One sees that effects of matter become more important at higher energy. Note 
that we ignored terms proportional to the unit matrix. Apart from the aforemen- 
tioned fact that they can be transformed away, there is another justification: after 
diagonalization the mixing angle and the difference between the eigenvalues are 
independent of equal diagonal terms (see Problem 13.1). Exactly such terms ap- 
pear in the formula (13.30) for the oscillation probability and have been omitted. 
Diagonalization of the matrix gives the mass difference 


mêm — my = Am, = Am?/[A — cos(20)]? + sin?(20), (13.31) 
where 
2V2EGpne 
ne AO 
Am? 


and the mixing angle (Wolfenstein, 1978; Mikheyev and Smirnov, 1985) 


g p 
moira A ue LG ; 
cos(28) — A [cos(20) — A]? + sin?(26) 
In the limit ne — 0, we recover the formulas for oscillation in vacuum. The careful 
reader will notice that there is a resonance effect when A = cos(20) (Mikheyev and 
Smirnov, 1985). For this value of A the mixing in the medium is maximal, even 
though the mixing in the vacuum can be very small. 
The angle 6y expresses the matter eigenstates in terms of the flavor states 


v cosOq —sind v 
a ena S MIET, (13.33) 
Vom sinOy  cosOy Vu 
We discuss two cases realized in the Sun. We know that electron-type neutrinos 
are created in the interior of the Sun, where ne is very large. In that region 


(13.32) 


20 T 
tan(20m) > = and MZ F (13.34) 


since tan(26\,) approaches zero from negative values. Equation (13.33) now gives 
Vim ~ —Vy and vm © Ve, i.e. in the interior of the Sun the state with the heavier 
effective mass is the electron neutrino. As the beam transverses the Sun, ne decreases 
and tan(20y) ~ tan(2@), therefore 0m ~ 0 and vay ~ Vy. The flavor content of the 
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beam changes. This phenomenon is called the MSW effect. It will be interesting 
to measure in terrestrial experiments the lepton-number content of the neutrinos 
arriving from the Sun. 

Next we discuss terrestrial experiments with a v,, beam going through the Earth. 
This situation can be realized in long-baseline accelerator experiments or via at- 
mospheric neutrinos. We assume that the v,, oscillates into another flavor state, as 
reported from the Superkamiokande experiment. Two cases are of interest. 


Case 1 


The v,, mixes with the vz. The interaction with matter proceeds with the exchange 
of a Z? boson and is identical for u and T neutrinos. It will thus add the same term to 
the diagonal elements of Hgayor, which does not affect the oscillation. This means 
that no effects of matter appear. 


Case 2 
The v,, mixes with a sterile neutrino v,. The interaction with matter in the transition 
Vu — Vy proceeds through the exchange of a Z? boson, but there is no such process 
for the v,. We must add an interaction with matter only in the uu element and the 
propagation Hamiltonian has the form 


1 m? „ — V2EGeny m? 


avor = 3r F . 13.35 
Ha IF ra ae ( ) 


The notation is analogous to that in Eq. (13.30) with mass elements Tis M?s» 
and m2, similar to those in Eq. (13.30). The angle 6 now governs the oscillation 
between v, and vs. We note that the sign of the matter term is reversed and the 
neutron density n, replaces the density of electrons. 

Finally, Eqs. (13.31) and (13.32) depend on cos(26) and the range 0 < 0 < 1/4 
does not cover negative values of cos(20). The range must now be extended to 


0<60<27/2. 


13.3 Experimental results 


There has been a long search for neutrino oscillations. The experiments were 
carried out with neutrino beams from nuclear reactors, the Sun, and accelerators, 
and recently with atmospheric neutrinos. In recent years several experiments have 
provided evidence for neutrino oscillation. We will not discuss the experiments in 
detail but only summarize the main results. For an overview of the current status, 
see Mohapatra et al. (2005). 
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e Atmospheric neutrinos Compelling evidence for oscillation comes from experiments 
with atmospheric neutrinos. They are produced in the atmosphere by meson decays, 
which in turn are produced by the interaction of cosmic rays with nuclei of the atmos- 
phere. Both electron and muon neutrinos are produced in pion and muon decays and the 
ratio at production is roughly two muon neutrinos to one electron neutrino. Neutrino in- 
teractions are detected in a huge underground target/detector — the Superkamiokande. 
The experiment established that there is a decrease in the number of muon-type neutrinos 
and no decrease in number of electron neutrinos. This implies that the v, neutrinos 
oscillate into another type, such as T or sterile (see below) neutrinos. A zenith-angle 
dependence of the disappearance is observed, whereby more of the neutrinos coming from 
underneath the detector disappear. They are produced in the atmosphere and travel through 
the Earth in order to reach the detector. These observations require a mass difference of 
Am? ~ 2 x 107° eV? and the mixing angle to be maximal. 

e Solar neutrinos Nuclear reactions in the Sun produce a tremendous amount of heat, 
which is radiated and at the same time produces electron neutrinos. The cycles which 
produce them have been studied and the neutrino spectra have been calculated. For three 
decades experiments measuring the flux of neutrinos from the Sun have observed deficits, 
indicating that v, neutrinos, during their journey from Sun to Earth, oscillate into other 
types of neutrinos. Previous experiments trying to verify this result with reactor neutri- 
nos failed to find a deficit, indicating that the distance of 1 km from the reactors is too 
small. The explanation of the solar deficit requires that it is either oscillation in vacuum 
with a large mixing angle and Am? ~ 107! eV? or oscillations inside the Sun where 
effects of matter according to the MSW effect are important. The latter possibility gives 
two solutions with either large mixing (LAMSW) or small mixing (SAMSW) and with 
Am? ~ 8 x 1075 eV’. 

e Reactor experiments Reactor experiments search for a decrease of the antineutrino flux 
far away from the reactor. The CHOOZ collaboration detected Ve at a distance of 1 
km away but saw no deficit. These data restrict the mixing angle 013, as we will discuss 
below. The experiment KamLAND measured the flux of Y, neutrinos from distant nuclear 
reactors. The experiment uses a target containing 1000 tons of liquid scintillator viewed 
by more than 1800 light-detecting photomultiplier tubes. It detects electron antineutrinos 
emitted by ~ 70 nuclear reactors in Japan and South Korea arriving from an average 
distance of 180 km. The ratio of the observed antineutrino interactions to the expected 
number without disappearance of Ve is 0.611 + 0.085 (statistical uncertainty) + 0.041 
(systematic uncertainty) for Ve energies >3.4 MeV. In the two-flavor analysis only the 
“large-mixing-angle” solution is allowed. The best fit of their data gives Am?, = 6.9 x 
1075 eV? and sin?(20) = 1.0, which selects the large-mixing-angle solution. 

e The Sudbury experiment The Sudbury Neutrino Observatory (SNO) detects 8B solar 
neutrinos through the reactions 


vetd>pt+pte, 
Ve +d > p+n+vz, 


Vv, +e >v, +e, 
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where the subscript x denotes any type of flavor. Only electron neutrinos produce charged- 
current interactions, while the neutral-current (NC) interactions and elastic scattering are 
sensitive to all types of flavors. The NC reaction measures the total flux of all active 
neutrino flavors produced in the Sun with energy above the threshold of 2.2 MeV. This 
provides a measurement of the solar neutrino flux as 


—2 sl, 


5.21 + 0.27 (statistical uncertainty) + 0.38 (systematic uncertainty) x 10° cm 


which is in agreement with the standard solar models. A global analysis of these and other 
solar- and reactor-neutrino results yields Am? = 7.1 e x 1075 eV? and 0 = 32.5 Ta 
degrees. 

Accelerator neutrinos The LSND experiment has found an electron-antineutrino excess 
in a muon-antineutrino beam. The solution would be small mixing and a mass-squared 
difference of 0.1-1 eV?. However, the very similar KARMEN experiment has found no 
such effect, but it cannot rule out LSND’s complete parameter space. The LSND results are 
referred to as the LSND anomaly because the large mass difference cannot be reconciled 
with the three-family model. The MiniBoone experiment at Fermilab is currently running 
with the aim of checking the LSND result. 

Long-base-line experiments There are several experiments that will use two detectors, 
one close to the accelerator, where the neutrino beam is produced, and a second detector 
at a distance of 200—400 km. The nearby detector will be used for calibration reasons to 
determine the beam and properties of the neutrino reactions. The faraway detector will 
be looking at the changes that take place because muon neutrinos oscillate to neutrinos 
of another flavor. A small-scale experiment, K2K, operated in Japan and two others are 
under construction, MINOS in the USA and OPERA in Europe. 


For three flavors of neutrinos there are only two independent Am°?. If the result 
from the LSND experiment is correct, then there must be a fourth flavor, which has 
to be “sterile,” i.e. it does not couple to the gauge bosons and therefore does not 
contribute to the Z° width. The results of the other experiments are summarized as 
follows: 


(2 x 10-3, ~0.5), atmospheric, 


(7 x 105, ~0.3) LAMSW, solar, KamLAND. 
(13.36) 


We can now estimate the magnitude of the leptonic MNS matrix. Let us ignore 
the LSND result and assume that there are two mass-squared differences govern- 
ing the atmospheric and solar oscillation. They obey Ams < Ami. If we iden- 
tify Am}, = Ams K Am4 = Am}, ae Am}, then we have, for a short-baseline 
reactor experiment such as CHOOZ (see Problem 2), 


(Am? (eV’), sin?@) ~ | 


pCHOOZ — 1 — 4 Ual — [Ues sin, (13.37) 
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where A;; = Am;, L/(4E) and L is the distance for the CHOOZ experiment. The 
absence of disappearance of neutrinos in the experiment means that |U.3| is either 
small or close to unity. The probability for solar neutrinos is (Bilenky, 2003) 


[Ver |? |Ue2|? 


PÈ = (1 = |Ua D? 1 — 4 
AE (= Ua? 


sin?) + |Ue3|*. (13.38) 
Experimentally, PÈ is significantly less than unity and also energy-dependent. On 
combining the results of the CHOOZ experiment with the solar deficit, we conclude 
that |U.3|? < 1. Finally, atmospheric neutrinos oscillate with 


Pd, = 4U pl lUz3? sin?A31. (13.39) 


Note that oscillations of reactor and atmospheric neutrinos are triggered by the same 
Am7. If for the latter we use Am? = 3 x 107? eV?, CHOOZ gives |Ue3|* < 0.05. 
As a first approximation we can assume |U,3| ~ 0, maximal atmospheric mixing, 
and the LAMSW solution for solar mixing. With these results, we can approximate 
the MNS matrix by 


c KY 0 
Uyi > | —s/2 c/2 1/V2 | LAMSW, (13.40) 
s/2 —c/2 1/42 


where c = cos 0, s = sin 0, and s? ~ 0.3. Note that there can be a phase in the 
matrix, resulting in CP violation as in the quark sector. However, for U.3 = O the 
theory is effectively a two-flavor theory. Only for a non-vanishing U,3 element 
could one establish CP violation in long-baseline experiments. 

If neutrinos are Majorana particles (see below), then there are two additional 
phases in the mixing matrix. They can be shown to have no influence on oscillation 
physics and reveal their presence only in neutrinoless double beta decay, which we 
will discuss below. 


13.4 Majorana neutrinos 


The results of the last section give strong evidence in favor of massive neutrinos. 
The oscillations indicate that in the leptonic sector flavor number is not conserved, 
i.e. muon neutrinos can become tau neutrinos, etc. This is a change from one 
family to the next. So far there has been no discussion of a particle changing into 
its antiparticle. The mixing among the families is produced when we introduce a 
mass term of the form 


L? =) movi; + hc., (13.41) 
ij 
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with i and j running over the families. For the electroweak theory, special attention 
is required because the states are classified according to helicities — with left-handed 
particles in doublets and right-handed particles in singlets of weak SU(2). The mass 
term is now 


LP = X miur; + he. (13.42) 


i,j 


It is known as the Dirac mass term and is produced by the Higgs mechanism, as 
described at the beginning of this chapter. 
There is also the possibility of introducing new mass terms, 


1 = 1 ‘ 
5 MRNRNR and JML (13.43) 


and their Hermitian conjugates, which are called Majorana mass terms. Obviously, 
Majorana mass terms can include mixing among the generations by introducing 
indices i and j, as in Eq. (13.42). They also mix neutrinos with antineutrinos. The 
new terms are Lorentz-invariant, and they carry lepton number two, thus introducing 
violation of the lepton number. Again, special attention must be paid to the peculiar 
property of the electroweak theory which classifies the states according to helicities. 
This introduces special requirements on the terms which are allowed. For instance, 
the Lorentz structure gives the identities NrNp = Lv = O and Ñ RY, = LNG = 
0. A Majorana state is defined as one with equal components of particles and 
antiparticles. It follows from the above identities that 


(NR + Ñr) (NE + Nr) = NRNR + NRG, 


which indicates that the mass terms introduced do indeed correspond to Majorana 
particles. 

Similarly, specific terms are permitted by the SU(2) symmetry. We can introduce 
the term MRN RNR, since it is Lorentz-invariant and SU(2) singlet. We cannot 
introduce the term M,v; vL, because it is the direct product of a doublet with a 
doublet, which decomposes into an SU(2) singlet plus a triplet. Their product with 
the Higgs doublet cannot produce a singlet. Similarly, their product with a singlet 
produces a singlet and a triplet. SU(2) symmetry dictates that M, = 0, unless we 
introduce triplet representations of Higgses. The possible mass terms are a Dirac 
mass term as in Eq. (13.42) and, in addition, a Majorana term appearing as the first 
term in Eq. (13.43). We sum up these terms in a matrix notation: 


_ {= we 0 Mp vÈ 
Lmass = (RCs, A & +h.c. (13.44) 
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We can diagonalize this mass matrix with an orthogonal transformation. The eigen- 


values are 
1 
M2 = 5 ( me +,/ MÊ + amb); (13.45) 


which for mp/Mpr « 1 become 


à = is and Az = Mart PD, (13.46) 
Mr Mr 
The case we studied is called the seesaw mechanism (Gell-Mann et al., 1979; 
Yanagida, 1979; Minkowski, 1977) because when one mass is large the other is 
small. Now, since the neutrino masses are very small relative to lepton and quark 
masses, for instance my œ% 107? eV, the seesaw mechanism supplies an explanation 
provided that the Majorana mass, Mp, is large. 
The corresponding wave functions, omitting a normalization constant, are 


m 
Pi = v + ua NR 
eras (13.47) 
Wr = vie + NR, 


indicating that yı consists primarily of the normal neutrino with a small admixture 
of Ng. The other state y7 is the heavy state with a large component Np and a small 
admixture of vy. 

Identifying vy or yı with the standard-model neutrinos explains their lightness by 
introducing a heavy scale. It follows a hierarchical mass scheme for the neutrinos. 
Neutrino masses my ~ 107? eV require mp ~ 10!° GeV for a Dirac scale of 1 GeV. 
This is a typical scale of grand unified theories, which is one of the reasons why 
seesaw models are very popular. 


13.5 Neutrinoless double beta decay 


Much effort has been devoted to discovering the nature of the neutrino. The 
experimental results on oscillations are independent of the Majorana character 
of the neutrinos. They observe oscillation of flavors. Experiments also search for 
evidence of Majorana neutrinos. A reaction that is conceptually simple concerns 
the conversion 


Ve +N —> et + hadrons, 


which has, unfortunately, a very small cross section. Theoretical estimates indicate 
that the rate for a neutrino with an energy of 1 GeV is 10718-107? times smaller 
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Figure 13.1. Neutrinoless double beta decay. 


than the normal charged-current cross section. Experiments with neutrino beams 
are far away from these limits. 
A favorable reaction is 


2n —> 2p+(ee), 


which is known as neutrinoless double beta decay and is indicated as OvBB. The 
process is shown in Fig. 13.1 where a nucleus with Z protons and A — Z neutrons 
emits two W~ bosons and decays into a nucleus with Z + 2 protons and A — Z — 
2 neutrons. The two W~ bosons interact with each other and convert into two 
electrons. The lepton propagator is a Majorana neutrino containing both particles 
and antiparticles. The amplitude for the process contains the leptonic tensor 


- pom; 1_ 
Lup(OVBB) x Use YuY-—5 a YB +E = Umi Pe Ye (13.48) 


where p is the momentum of the propagating neutrino, m; its mass, and e are the 
electron fields. The element U,; gives the coupling of the electron to the ith neutrino 
mass state. There is also a corresponding hadronic tensor W%’, whose structure is 
rather complicated. Its structure is unfortunately left out in most theoretical articles 
and discussions in spite of the fact that it introduces considerable uncertainty in 
the predictions. The decay rate of the process is obtained by first summing the 
amplitudes over all intermediate states and then squaring the total amplitude to 
arrive at 

2 


P(OvppB) =A : (13.49) 


2 
2 Uam; 
i 
where A is a non-trivial factor representing nuclear matrix elements. The quantity 


me > Umi, (13.50) 
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with the sum running over all neutrinos, is called the effective electron-neutrino 
Majorana mass, or in short the effective neutrino mass. The neutrino masses in 
the propagator are physical masses and are very small relative to q? involved in 
nuclear decays. The masses m; and the mixing elements Ues; also appear in the 
oscillations of neutrinos. Unfortunately, the oscillation experiments measure 
Amọ, i.e. mass differences, rather than absolute values for the masses. They also 
measure the squares of the U.;, which in general are complex functions. Thus, 
extracting mass differences and mixing angles from oscillation experiments, there 
is still a large range of values possible for mee (Pascoli et al., 2005; Choubey 
and Rodejohann, 2005). Processes depending on the effective Majorana mass have 
branching ratios and cross sections that can be very small. The current limit for 
Mee comes from neutrinoless double beta decay of 7°Ge and is approximately 
0.2 eV. There is only one experiment (Klapdor-Kleingrothaus et al., 2004) in which 
it has been claimed that the Majorana mass has been measured. The best value given 
by the group is (Mee) = 0.39 eV, but the data analysis of this experiment has been 
criticized (Aalseth etal., 2004). There are plans to build improved experiments. 
This is a difficult but very interesting and exciting field, regarding which the reader 
can consult other articles (Aalseth et al., 2004). 


Problems for Chapter 13 


1. Show that the mixing angle @ in the matrix 
cos@ sind 
A o cos 4 
which diagonalizes the symmetric matrix 


u=(z $) 


tan(20) a 
an = ——— 
d—a 


is given by 


and the eigenvalues 


2b 


1 
E\2 = 5 [e + d) = ool 


What happens to the mixing angle and the difference of the eigenvalues of M when one 
adds a term proportional to the unit matrix to M? 

2. For three generations there are only two independent mass differences. For this reason 
the observation of two oscillations, e > u and u — Tt, determines the mass-squared 
differences 

Am, = Am‘), 


QP <2 Os. 2 
Am, = Am3, = Am3,. 
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For the CHOOZ experiment the distance is approximately a kilometer, so Am‘, L/E 

is very small and does not contribute to the oscillation. 

(i) Use Eq. (13.22) and approximations suggested by the description of this problem 
to derive Eq. (13.39). 

(ii) Use again Eq. (13.22), the same approximations, and the unitarity of the mixing 
matrix to derive Eq. (13.37). 

. The general Lagrangian with Dirac and Majorana mass terms is 


-L = FAV + Moltive the] + [Guta the] + 2 [eve the], 


with M, and Mp being Majorana mass terms for the left- and right-handed neutrinos. 
(i) Show that it can be written in the form 


-L=V9V+V[M]V, 
where V is a column matrix, 
pal i + cea 
2 \Yr + (YR) 
and [M] is a symmetric matrix, 
M, Mp 
M] = : 
pae a ie) 
[M] is the neutrino mass matrix. For M, = 0 it reduces to the seesaw case. 


(ii) Let yı and yf be the eigenvector fields of the mass matrix with eigenvalues à; and 
A2, respectively. Then show that L can be rewritten as 


-L = Wig + v2 dn + didi + Adon. 
The physics content of the Lagrangian is now clear: it is the free Lagrangian for two 
particles yı and wy. 


. Consider the Dirac Lagrangian and add the term 


Vasy. 


The Dirac equation now becomes 


Lid — yso -mly (x) = 0. 
When (x) is a function of x,,, the addition term represents the interaction of the fermion 
with a scalar field. We may also consider ¢ to be a constant, i.e. independent of space 
and time. Consider the latter case and search for plane-wave solutions 


w(x) = upe. 
(a) Which condition must p,, satisfy in order for solutions to exist? 
(b) Once p, satisfies the conditions, which are the linearly independent spinors? 


(c) Find a chiral transformation T = e”? so that the Dirac equation is brought into the 
form Gg — m)y(x) = 0. 


Comment This exercise shows that, whenever ¢ is a constant, the term pY (x)ys Y (x) 
is a mass term. It also follows that the sign of the fermion mass can be changed by a 
chiral transformation. 
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14 
Heavy quarks 


14.1 Introduction 


When quarks were introduced into physics, they were considered to be light, like 
the up, the down, and the strange quarks. Their bound states were light mesons, 
such as the 7t and K mesons. In fact, light states were interpreted as the Goldstone 
bosons of the SU(3) symmetry with several of the aspects of Goldstone particles 
discussed in Chapter 5. 

The next quark is the charm quark, which was formulated in order to suppress 
flavor-changing-neutral couplings in the K mesons (Glashow et al., 1970) (GIM 
henceforth). Since the charm quark is much heavier than the proton, its existence 
gave rise to the possibility that additional heavy quarks may exist. The expectation 
was confirmed with the discoveries of the bottom and top quarks when accelerators 
of higher and higher energies began operating. 

The precise definition of quark masses is a delicate topic and for this reason 
we shall discuss some of the issues involved. Masses of fermions appear in the 
electroweak Lagrangian after the breaking of the symmetry, i.e. when the Higgs field 
acquires a vacuum expectation value. Masses for particles are measured through 
their interactions with an external field; for example, the bending of an electron 
beam in a magnetic field determines the ratio e/m. The interaction contains higher- 
order corrections, which must be included. For leptons the masses are defined as 
poles of the propagators. For quarks the situation is more complicated because they 
never appear as free particles, but are always confined within hadrons. The masses 
of quarks must include radiative corrections from the forces which confine them. On 
the energy scale of the heavy quarks the strong coupling constant is small enough 
to allow perturbative calculations. We define the running quark mass m(p”) as the 
renormalized mass parameter in the quark propagator 


i = iZ3(p) 
p—mo—X(p)+ie  p—m(p?) +ie’ 


S(p) = (14.1) 
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In this expression mọ is the bare mass and X(p) is the fermion self-energy cor- 
rection obtained in perturbation theory, which contains infinite terms. The infinite 
terms combine with mo to define the physical mass and define the wave-function 
renormalization Z3(p). The specific method for eliminating (subtracting) the in- 
finity is scheme-dependent and brings into the definition of masses and coupling 
constants the renormalization scale mo, where the subtraction takes place. This 
is very similar to the definition of the running coupling constant described in 
Section 11.2. 

So far have we defined quark masses as functions of the renormalization point 
Ho and the momentum p at which they are measured. We need an additional pre- 
scription for relating them to masses of hadrons or for describing how we estimate 
reactions in which heavy quarks occur. There are several methods for extracting 
quark masses from physical processes. One of them uses sum rules over two- 
point functions (spectral functions), which are saturated by physical data and are 
matched to theoretical expressions that include running quark masses (Narison, 
2001; Manohar, 2000). Another method appeals to decays of mesons containing a 
single heavy quark. Here one computes the decay in a two-fold expansion: in pow- 
ers of a;(p) and a non-perturbative series in powers of Agcp/mg (Manohar and 
Wise, 2000). The masses extracted by the various methods are close to each other 
and contain an explicit dependence on the momentum at which they are measured. 
The spectrum that emerges for the new quarks is 


me(me) = 1.15-1.35 GeV, 
mp(mp) = 4.6-4.9 GeV. 


The mass of the top quark is determined in another way. The observed events 
are attributed to the production of tt pairs and their subsequent decays into leptons 
and hadron jets. The production of the pairs is computed in the parton model using 
lowest-order QCD. The value for the top quark appears in the calculation and 
the value which optimizes the fit is reported as the standard-model mass, giving 
(CDF/DO, 2005) 


m, = 174.3 + 3.4 GeV. 


Besides the masses of the heavy quarks we need their couplings to charged and 
neutral currents. They are accurately determined by the gauge nature of the theory 
and experimental data. The couplings have already been discussed in Chapter 9. 
There are several other properties of heavy quarks that we take up in this chapter. 
First, the decays of states containing heavy quarks involve hadronic matrix elements 
that simplify considerably. In addition, when they appear as intermediate states in 
loop diagrams, they dominate the diagrams and produce new phenomena such as 
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the mixing of states and CP asymmetries. Examples of the new phenomena are 
(i) B°-B° mixing dominated by box diagrams and (ii) some CP asymmetries dom- 
inated by penguin diagrams. Finally, heavy quarks may have strong couplings to 
induce bound states that imitate the Higgs particles. Some of these topics will be 
covered in this and the following chapters. 

It is still interesting to ask which quarks are considered to be heavy. A heavy 
state is one with 


mg > Aco; 


where according to QCD the interparticle forces become weak. For instance, the 
inclusive semileptonic decays of pseudoscalar mesons containing a heavy quark 
must be given to a first approximation by the spectator model. The spectator model, 
improved by the momentum distribution of a b quark in a B meson, gives an accurate 
description of semileptonic B-meson decays. Thus B and higher meson states are 
considered to be heavy states. In fact, for hadronic states containing a heavy quark, 
a systematic expansion has been developed in inverse powers of the heavy-quark 
mass, which will be described in Sections 14.4 and 14.5. The top quark decays so 
fast that bound states do not have enough time to form. 
A crude test for the validity of the spectator model is provided by the ratio of 

lifetimes of charged and neutral mesons. For mesons with beauty quarks, 

TB+ 

— = 1.08 + 0.01, 

TBO 
which is to be compared with the ratio for mesons containing charmed quarks, 


Tp+ 


= 2.55 + 0.01. 
Tpo 

The ratio for the B mesons suggests that the spectator model is applicable. The 
ratio for D mesons indicates that there are additional contributions. In fact, it was 
a surprise when the experimental colleagues established this large ratio. It is now 
generally believed that annihilation diagrams are important in D-meson decays. 
The simple picture that mesons consist of a quark and an antiquark is too naive, 
because the gluons in D mesons play a very active role in binding the quarks. We 
illustrate this fact by the following argument, where we set the CKM matrix equal 
to the unit matrix, which makes the V.q element zero. The spectator decay mode 
(see Section 14.2) 


c—>s+d+u 


is expected to give equal contributions to D? and D* decays, and its amplitude will 
be denoted by Asp. In addition, for the D° meson there is the annihilation diagram 
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for the reaction 
c+ —> s + d + gluons, 


with a W* exchanged in the t-channel. Its amplitude will be denoted by Aan. The 
decay width for D? depends on the sum of the two amplitudes, 


Tp & |Asp|? + |Aanl”, 
and similarly for D*, 
Toe œ |Aspl?. 


The CKM-matrix elements are the same in all these diagrams, indicating that the 
lifetime tp+ is larger than tpo. Precise calculations for these and other decays of 
charm mesons are difficult, but there are phenomenological models that give a 
consistent picture for several decay channels (Bander et al., 1980). 

This chapter is long and contains various topics. To help the reader I outline 
its contents. The following three sections describe decays of heavy quarks in phe- 
nomenological models. Some of these models have been very popular. Sections 
14.4 and 14.5 are devoted to the heavy-quark effective theory (HQET), which is a 
systematic expansion of amplitudes in inverse powers of the heavy-quark mass. The 
top quark is very heavy and has special properties. For this reason a special section 
is devoted to it. Finally, heavy quarks play an important role in loop diagrams, with 
a simple example being provided by box diagrams, which are ultraviolet-finite. 
An introduction to the computational methods for box diagrams is included in 
Section 14.7. 


14.2 Semileptonic and inclusive B-meson decays 
14.2.1 The spectator model 


When a heavy quark, generically denoted by Q, decays inside a hadron, it does 
so without disturbing the surrounding field produced by other quarks, antiquarks, 
and gluons. In the B mesons, for instance, the b quark decays, leaving the light 
antiquark u, d, or s (hereafter generically denoted as q) undisturbed (spectators). 
Thus we expect the decay of the B meson to be given by the free decay of the b quark 
to a zeroth-order approximation. This result must be improved for the bound-state 
effects of the meson, as will be described in this section (Altarelli et al., 1982). 

In the spectator model, the light quark moves in the field of the heavy quark, 
and between them they share all the energy and momentum of the meson. The 
spectator quark q has a definite mass m and a four-momentum pq = (Eq, p). In 
the rest frame of the meson the b quark moves with a virtual mass W and a 
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Figure 14.1. B-meson decay. 


four-momentum pp = (Ep, —p), where 


Eg + Ep = Mp, 


w° = Mê +m; — 2Mg,/p* + mi, 


and Mg is the mass of the B meson. The decay is now given by the point-like decay 
of the b quark averaged over its momentum inside the meson (Fig. 14.1). 

In the model of Altarelli et al. (1982), henceforth called the ACCMM model, 
the averaging over the momentum of the b quark is done by introducing a Gaussian 
distribution function, 


(14.2) 


4 
VT pè 


where pp, the Fermi momentum, is a free parameter to be adjusted by comparing the 
theoretical prediction with the experimental spectrum. Its value is of the order of a 
few hundred MeV at most, so large-p configurations are exponentially suppressed. 
With this distribution the b quark remains close to its mass shell. 

To calculate the electron spectrum from the semileptonic decay b > u(c)eVe, 
one follows three steps. 


o(Ipl) = exp( — |p|?/ pp), (14.3) 


1. One calculates the differential decay width for a point quark in its rest frame. This is 
exactly analogous to the muon decay and the result is 


dro G?w* 


P Faas Vol G — 2x). (14.4) 


Here, E is the electron energy, Vab or Ve are the CKM-matrix elements, and x = 2E / W. 
Note that we have neglected the masses of all final-state particles, which is surely not 
justified for ab — c transition, with the relevant mass dependences given in the ACCMM 
article. 

2. Inaddition, one may incorporate the QCD corrections to this tree-level decay distribution, 
by multiplying dr?/dE by an overall factor that is less than unity and depends upon 
x. This changes the shape of the spectrum near its endpoint. This will not be discussed 
here, but is described briefly in Problem 2. 
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3. The decaying quark is not at rest but moves with a momentum |p| inside the meson. We 
can account for this motion by writing the decay rate in a Lorentz-invariant form, which 
will be valid in any frame. To this end, we write first 


(Ey, Ee — ppe cos 0). (14.5) 


The first two equations are computed in the rest frame of the b quark and the third one 
in a frame where the b quark moves. Here E? = W? + p°, p? = E? — m2, and 8 is the 
angle between the momentum of the b quark and the direction of the electron. The final 
decay rate is obtained by averaging over p and @ using the distribution function 


eee lV, pf” W*x7?(3 — 2x)6(p)p? dp d cos 8 (14.6) 
= a x — 2x cos ð; : 
dE, 48m J, sa 


here x depends on p and the range of integration over 6 is from 0 to x. 


This method of averaging over the Fermi motion respects Lorentz invariance 
and takes into account the phase space. It is somewhat crude, in the sense that the 
energy and momentum of the meson are supposed to be distributed between the 
two constituent quarks only and the distribution formula is valid only when |p| is 
rather small. In spite of its crudeness, the model describes accurately the electron 
spectrum observed in the experiments. It indicates that for B and heavier mesons the 
spectator model is a reasonable zeroth-order approximation, requiring additional 
corrections from the interaction of the heavy quark with its surrounding field. We 
shall discuss some improvements in this chapter. 


14.2.2 The parton model 


The semileptonic B-meson decays can be presented by the diagram in Fig. 14.2, 
where X can be a specific final state or the incoherent sum over various final states. 
The drawing represents the square of the amplitude and looks very much like the 
diagram for deep inelastic scattering. This suggests that a formally similar analysis 
for the B-meson inclusive decays is possible. There are two differences, however; 
the initial B meson is heavy and the current is time-like, with its variables determined 
by the initial and final hadronic states. Consequently, we use the same formalism 
but the structure function is now different, giving the probability of finding a b 
quark in a fast-moving B meson. 

In a standard analysis the square of the matrix element is given as the product of 
a leptonic and a hadronic tensor, 


IMP = L” Wp. (14.7) 
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Figure 14.2. Semileptonic B decay in the parton model. 


The hadronic tensor can be written as 


2 1 4 iqy 
Wasla? sp) = zz | axe (BILJE), OMB) 
= —gy Wi + PuPvWe +i€prapp’q?W3+---, (14.8) 


with the notation being analogous to that introduced for neutrino reactions. After 
analyzing the kinematics of the decay, it was shown that a large region of phase 
space is dominated by the singular behavior of the commutator on the light cone. 
This allows us to express the decay in terms of a quark distribution function f(x), 
which denotes the probability of finding a b quark in the B meson. Dominance of 
the light cone demands that the analysis of the decay be carried out in an infinite- 
momentum frame. 

We visualize the B meson moving in an infinite-momentum frame, where the 
motion of the constituents slows down relative to the time of the decay. In this frame 
the b quark decays, leaving the rest of the B meson undisturbed. The decay is in 
general given by 


dr 2 G Vel 
dE, dqo dq? 323mg 


faem, + (4Eeqo — 4E? — q’) W2 


W3 
— 2q’ (2E; — w). (14.9) 
mB 
The hadronic tensor for point-like interaction is 


1 
Woa: p) = 5 Tr[ poy" — ys) poy" — y5)]5[(p>— g)"], (14.10) 


with pu and pp the four-momenta of the u and b quarks, respectively. The b quark 
carries a fraction x of the B meson’s momentum and has the momentum distribution 
function f(x). We arrive at the B-meson decay by substituting 


Po=xP and p= p-q, (14.11) 


then multiplying Wj" by f(x) and integrating over x, 


wee = J Wea? xa- p)f (x)d(x* p — 2xq - p+q?)dx. (14.12) 
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The 5-function has two roots x+ = (qo + |q|)/ Mpg. The smaller root x_ corresponds 
to diagrams with particles moving backwards in time and will be neglected because 
it is numerically small. The structure functions are given as (Bareiss and Paschos, 
1989) 


4Mp > 2Mp 
Wi = 2x4 f (x+), W = a tt oP» W; = “a 


(14.13) 
Direct substitution leads to the final result 


dr G*|Vavl? xf (x) 2 
= — E.)(2E — q^), (14.14 
dE, dqo dq? 4r°mpg q2 — Ti oime) ( l 
0 


with x = (qo + |q|)/Mp. 

It is hard to measure the triple differential decay and several integrations must be 
carried out. We define the limits of the integrations. For fixed q? we can substitute 
qo by the hadronic mass s = ings 

S= ms +q? —2mgqo, 


14.15 
= my + g? — 2mg Ee — 2mg Ey. ( ) 


We can replace Ey by Ey = q*/[2E.(1 — cos @)] and obtain the upper bound of s 
at cos@ = —1: 


2 
2 <s< OF. = ee), 14.16 
mz <s < (mp (ms 2E, ( ) 


The other two variables are bounded in the regions 


0 <q? <2mpE,, (14.17) 
mB 
0<E.< a (14.18) 


The triple differential rate can be used to extract the distribution function directly, 
but decay rates in several variables are not available yet. Instead one adopts an 
Ansatz for the distribution function. Here we are guided by physical intuition and 
the experience gained from the fragmentation functions. In the boosted frame, we 
expect the heavy quark to carry most of the momentum of the meson, which means 
that the distribution function is peaked at x ~ 1. Thus a function with a peak in 
the high-x region and a small width should be sufficient. A function with two 
parameters a and b that satisfies the above criteria is 


x — x) 


FO" NO pea 


(14.19) 
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Figure 14.3. The predicted electron-energy spectrum compared with data. 


with N a normalization constant. Integration over the variables and for a = 
0.0118, b = 0.931 produces the electron spectrum shown in Fig. 14.3. A simi- 
lar curve is obtained in the ACCMM model. The data points are from Barish et al. 
(1996). 

It is desirable to calculate the distribution function or determine its parameters 
from basic principles. Attempts in this direction have been made in the heavy-quark 
effective theory. 


14.3 Exclusive semileptonic decays 


Of special interest are decays in which the initial and final mesons are specific 
hadrons. In this case analysis in the quark-decay picture is not applicable. Let us 
concentrate on decays of a heavy B or D meson, to a meson X; and a lepton pair: 


M > Xjev, (14.20) 


where M is the initial heavy meson. The experiments measure the energy spectrum 
of the electron (1/IT°)dI°/dE, and, for comparison with the theoretical model, we 
calculate the invariant amplitude for the process. It is again a product of the known 
leptonic weak current L” and the hadronic matrix element H”: 


G 
M = — L"H, with AH” = (X;|j"|M). (14.21) 
v 
The explicit structure of the meson current depends on the definite properties of 
the final meson under Lorentz transformations. With such information itis possible 
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to derive the most general form-factor decomposition of the matrix element. As an 
example we present here the decay to a pseudoscalar state, i.e. X; = P(O7): 


f m2, — m? m2, — m? 
(PliulM) = ge Poa’) + [n + kp). — ete F\(q°), 
(14.22) 


with gq, = (km — kp), and Fo(0) = F;(0). Fo(q’) and F;(q*) denote the longi- 
tudinal and transverse form factors, respectively. The form factors are unknown 
parameters, which have to be estimated theoretically and then compared with the 
experiments. On the theoretical side, the most famous parametrization appears in 
the BSW model (Wirbel et al., 1985) which makes the following Ansatz for the q? 
dependence: 


F;(0) 


— 14.23 
ae (14.23) 


The pole masses m; are estimated numerically. Finally, to predict an energy spec- 
trum for the lepton produced in the decay of Eq. (14.20), an estimate for the form 
factors at q? = 0 is required. In the BSW appoach it is given by the overlap of the 
initial and final wave functions of the mesons. With this information, the phase- 
space integrals can be performed and many differential decay rates calculated 
(Wirbel et al., 1985). 


14.4 Heavy-quark effective theory 


The heavy-quark effective theory (HQET) is a systematic method for describing 
particles containing a heavy quark (Manohar and Wise, 2000). The HQET is based 
on the fact that QCD is flavor-blind and a hadron containing a heavy quark is 
very unlikely to have excitations containing antiquark degrees of freedom. Thus it 
suffices to work with the spinor of the heavy quark and view the light quark with its 
surrounding gluonic field as a composite system. This picture is an improvement 
over the spectator model and supplies a method for calculating the distribution 
function of the heavy quark in the meson. For mesons made up of a heavy quark 
Q and a light antiquark, the heavy quark is essentially on-shell and therefore static 
when the meson is at rest. This is analogous to atoms, in which the nucleus is 
stationary and the electrons move in the static field of the nucleus. We will see that, 
to zeroth order, properties of the meson do not depend on the mass, spin, or flavor 
of the heavy quark, just as chemical properties of the atom do not depend on the 
particular isotope. 

We shall demonstrate that the HQET is a limiting case of QCD where mg — œo 
with the four-velocity of Q, v,, held fixed. In this limit the Lagrangian simplifies 
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considerably, giving a static term and calculable corrections expressed in inverse 
powers of the heavy-quark mass. 
The four-momentum of the heavy quark can be expressed by 


Pu=MoQdy + ky, (14.24) 


with v, its velocity normalized to unity, v„v” = 1. The correction term k,, is small 
relative to mov„. The heavy-quark field has an energy dependence close to that 
of a free particle and a spinor h, with positive energy. Since the heavy quark is 
bound, it may be influenced by the cloud of quark—antiquark pairs. These degrees 
of freedom are represented in HQET by two spinors, which are eigenfunctions of 
the velocity operator. To be specific, 


Q(x) =e" Thy (x) + hL], (14.25) 
with 
phy = hy, pl, = =l. (14.26) 


Factoring out the phase is a redefinition of the field, but the decomposition into the 
components h, and L, is an approximation in terms of positive and negative velocity 
fields. 

The results can be re-expressed in terms of velocity projection operators 


P= ta +p), (14.27) 
which project out the positive velocity fields, 
P,Q = e™®*h,, (14.28) 
and the negative velocity fields, 
P_Q =e |, (14.29) 


The overall phase varies rapidly and among the spinors we expect h, to be dominant. 
The heavy quark interacts with the gluonic field and satisfies a Dirac Lagrangian 


L= O(x)[iP— mo] QŒ), (14.30) 


where D, = 0, +igs A; < 4°/2, with A‘ (x) the field of the gluons. We are inter- 
ested in approximate solutions of the Lagrangian in which the heavy-quark field 
h, is slightly perturbed by the light degrees of freedom. To simplify the algebra we 
mention two identities and drop the subscript v, i.e. setting h, = h and l, = J, 


Py yy P} = P4(P-Yp + dp) = Ps, 
P_y P- = P_(Ps¥y — vy) = -va P, (14.31) 
P O(x) = —ime¥Q +e ™ =h +l), (14.32) 
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with hl = 0 and Apl = Iph = 0. With the help of the identities, we rewrite £ in the 
following way: 


L=hiPh+1(iP —2mọ)l + hiPl +Tiph 
= ihv- Dh —I(iv- D+ 2mo)l + hil + liph. (14.33) 


In the last two terms a final simplification is possible, which allows one to replace 
D,, by its transverse component D,,, = D, — D - vv, because hyl =1ph = 0. 
Several authors use this additional simplification. The Euler-Lagrange equation 
for the field gives 
1 

(iv- D+2mg)l = iph => 1 = REET iph. (14.34) 
To sum up, we have taken out the dependence on large momenta by factoring out 
the term exp(—imgv - x). The remaining terms, except for the covariant derivative 
acting on h, contain small momenta, as is evident from the fact that /(x) is indeed 
the smaller of the two velocity spinors. 

In this limit of the theory, we give rules for the propagator 


p+mg _moeptmeth (3*) +0(=) (14.35) 
m 


p> — m 2mov -k +k? v-k 
and for the vertex describing the coupling of gluons to the heavy quark, 
ighy,h = igu,hh. (14.36) 


Calculations with h alone reproduce the free-quark model. The interactions are 
improved by including 1/mg terms. 
In the limit of mo — œ the quark field approaches the heavy-quark field, 


Q(x) > emer P h. (14.37) 
On substituting this field into the Lagrangian (14.30) or from Eq. (14.33), we obtain 
L£ —> ihv- Dh = Lo. (14.38) 


The result shows explicitly that in the limit mo —> œ the interaction between the 
gluonic field and the heavy quark is independent of the spin and mass of the heavy 
quark. There are several consequences of this property. 

As mentioned already, the meson is viewed as composed of a heavy quark and a 
light system consisting of the light quark and its surrounding gluonic field. Several 
properties should be independent of the spin of the heavy quark. Let us denote the 
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spin of the light system by S¢ and the spin of the heavy quark by Sg = F, The two 
systems may still have relative angular momentum L. The total angular momentum, 
J, for the meson is a sum of spins 


S= So + Se (14.39) 
plus the relative angular momentum L, 
J=5+İ1L. 
For a meson with S; = 5 and L = 0 
Š= (Set = =0, 1 
2 


and the total angular momentum J =Oor 1. For Q = b the two states with L = 0 
are By and B} mesons. The HQET predicts that the two states have properties that 
are independent of Sg and therefore degenerate. The difference between the masses 
of the pseudoscalar and vector mesons is indeed small relative to their sum. The 
same is experimentally true for the charm states D and D*. 

Even though the mass differences are small, they are not zero, suggesting that 
we must formulate the effective Lagrangian more precisely by keeping the 1/ma 
terms which were ignored. This will be discussed in the next section. The same 
is true for the charm states D and D* containing charm quarks. To first approxi- 
mation the spin symmetry is realized. However, the small differences suggest that 
we must formulate the effective Lagrangian more accurately by including 1/mQ 
corrections. 
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Higher-order corrections are obtained by including the light-quark degrees of free- 
dom. This complicates the algebra somewhat, but leads to correction terms that are 
simple to describe. In the following we give a derivation. However, the reader who 
finds it complicated may proceed directly to Eq. (14.43). 

To include the light-quark field, we substitute /(x) using Eq. (14.34) together 
with a similar expression for /(x). In this way we obtain an improved interaction, 


= 1 1 
L=Lo+Li=hļ|i i) ——_—_ iP jh + O| — 
pe (ip +o iP) a (=) 


z Ry 1 
= ihv- Dh + —hiPiph + © =| (14.40) 
2mQ mo 
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with the first term being Lo and the rest the improvement. The new term has two 
covariant derivatives next to each other and can be simplified. We use the identity 


Pp = y"y’ DD» 


1 1 
= 51y“, y”}D, Dv + =[v“, y"]D, D» 


2 
1 
= D’ + z[v“, v" [Du D] (14.41) 
together with 
Hv i H v i a ha 
g: = mike 2 Y } i8sG v 2 = [iia Dy |, (14.42) 
in order to obtain the 1/mg term of the Lagrangian: 
2 v 
= Guy 
jane eas me MALS (14.43) 


A few remarks are now in order. The first term, D? /(2mọ), is the average kinetic 
energy of the heavy quark. The second term couples the spin of the heavy quark to 
the gluo-magnetic field surrounding the quarks. The theory succeeded in including 
the interaction of the heavy quark with the surrounding degrees of freedom. It also 
has the advantage that it is precise enough to allow systematic studies. It has been 
successful in accounting for mass differences, decay rates etc., which we cover in 
the next section. 


14.5.1 Applications 
Mass relations 


To leading order the HQET Lagrangian does not depend on the spin or flavor of 
the heavy quark. The first term in Eq. (14.43) breaks the flavor symmetry and the 
second term breaks both flavor and spin symmetries. These two terms play a leading 
role in determining the mass spectra of heavy hadrons. Consider the expectation 
value of the Hamiltonian between mesonic states. The mass of the meson is written 
as 


mm = mg + A + (M|Oi|M) + (M|O2|M) 
dy ayho 


= | eae 
mo 3 2mQ 2mQ 


(14.44) 


with aj = 1SQ - So. Here O; and O, are the first and second operators in Eq. 
(14.43), respectively, and A is the contribution from the light-quark degrees of 
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freedom, which is a priori unknown. According to HQET, A; and Az are univer- 
sal constants. One generally determines them from the moments of the energy 
spectrum of inclusive semileptonic B decays. It turns out that A ~ 0.40 GeV and 
Ay © —0.2 GeV?. 

On is a magnetic-moment-type interaction, and its matrix element for mesons 
with L = 0 is proportional to Sg - Se = (J? — sê — S?) /2. The expectation values 
of this operator in triplet and singlet states are different, which causes a splitting 
among the lowest-lying states. The singlet (J = 0) is lowered and the triplet (J = 1) 
is raised, but by different amounts. These states are generally denoted by M and 
M*. It is obvious that 


Mpg: — MB Me 


Mp* — Mp Mp 


which is satisfied to a high degree of accuracy. In fact, one can determine A, from 
the splitting of the meson doublet. 


Exclusive semileptonic decays 


An important prediction of HQET describes decays governed by the quark-level 
transition b — c. The initial B meson decays into an electron—neutrino pair plus the 
D meson. In general the matrix element of pseudoscalar-to-pseudoscalar transition 
involves two form factors that are functions of q? (see Eq. (14.22)). The velocities 
for the B and D mesons are v” = p“/mg and v' = p'” /mp and the q? dependence 
may be replaced by 


w =v: v = — ~. (14.45) 


In the limit that the masses are very big relative to the difference mg — mp the two 
states B and D are, according to HQET, very similar. In this limit v - v’ = 1 and 
vt x vH, 

To appreciate the situation, consider a configuration in which the B meson is 
at rest and the four-momentum carried away by the lepton pair is maximal, q? = 
(Mg — Mp)’, which means that the D meson is also produced at rest. This is the 
zero-recoil point where w = 1. In the limit that the heavy-meson masses are large 
and close to each other, only one form factor contributes: 


(D(p’)|VulB(P)) = fea? Np + Pu + FQ — Pu 


=> (D(v')|ey,b|B(v) © /mpmp h(v - vv, + v). (14.46) 


The square root arises from the normalization of the states and h(v - v’) is a universal 
function called the Isgur—Wise function (Isgur and Wise, 1989, 1990). Furthermore, 
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in the limit mp, mp — œ with mg — mp fixed, q? < Mpg, Mp, the Isgur—Wise 
function measures the overlap of two identical wave functions so that h(1) = 1. 

Consider next the decay to a vector meson B > D*fv. We have discussed already 
the fact that to leading order the decay is independent of the spin of the heavy quarks. 
A consequence of the symmetry is the relation 


(D*(v', E)|Ey,b|B(v) = ifmpmp=epvape’ vv PhD - v’), (14.47) 


where the Lorentz structure depends on the polarization of the D* meson, €”, and the 
same universal form factor h(v - v’) appears as before. However, the universality 
is a feature of the zeroth-order Lagrangian and is broken by 1/mg corrections. 
For a complete analysis we must also include matrix elements of the axial current, 
which bring additional form factors into play. The general case must explain decays 
to several vector mesons D*(2010), D*(2420), ..., whose partial decay widths are 
different from each other. The differences remain after corrections over phase-space 
factors have been taken into account. 

This motivated an analysis of the partial decay rates in terms of h(v - v’) plus its 
slope at the point v - v’ = 1. The important variable for the decays is w, with the 
decay spectrum given by (Manohar and Wise, 2000) 


G|Venl? 


dr 
(es Dey 
dw 4873 


K(w)h(v- vy’, (14.48) 
where w is defined in (14.45), K (w) is a known phase-space factor, and h(v - v’) 
is the Isgur—Wise function including finite-mass and other (QCD, ...) corrections. 
The analysis determined the value of the form factor Fı(w = 1) and its slope at 
w = 1. A systematic analysis of several experiments gives the mean value of the 
slope and the product as 


h(w = 1)|Vip| = 0.0038 + 0.0010, 


from which the value for |V| has been extracted (see Section 9.3.4). 

The applications of HQET to mass relations and B decays to D and D* are 
successes of the theory. The Isgur-Wise function h(v - v’) has been studied and 
calculated also in lattice gauge theories. Of direct interest is its calculation for the 
physical b and c masses in the zero-recoil limit (when v - v’ = 1). Lattice calcula- 
tions give h(1) = 0.929 with a small error. 

B-meson decays to light mesons are much harder to calculate in HQET or on the 
lattice because most of the decay occurs in a kinematic region where the 7t or the 
p mesons have large momenta. They attract a good deal of attention because data 
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on B-meson decays are accumulating in B-factories and are very important for the 
interpretation of CP asymmetries in B-meson decays. 


Inclusive semileptonic decays 


For inclusive decays one must calculate the decay rate in the rest frame of the b 
quark. To leading order the hadronic tensor will have the structure of Eq. (14.10), 
which is proportional to the 6-function 


ôL» — 9)°] = 8(m — 2P» q +49’). 
In the limit m? >> q? it reduces to 
5(ms — 2mvqo), 


which peaks at the endpoint of the go range. Since there is no averaging over a 
distribution function, the smoothness of the spectrum must be brought about by 
higher-order corrections of HQET. Higher-order corrections including bound-state 
effects must reproduce the effects and spectrum discussed in Sections 14.2.1 and 
14.2.2. 


14.6 The top quark and its physical properties 


The top quark is very heavy and weighs as much as aheavy atom. After the discovery 
of the bottom quark and the tau lepton, the top was predicted in order to preserve 
the symmetry between quarks and leptons. In addition, several properties of the 
low-lying states required the existence of a heavier state. Among these properties 
are the mixing and CP properties of the neutral K mesons, as well as those of 
neutral D and B mesons. Furthermore, in order to accommodate CP violation in 
the electroweak theory, a third generation of quarks is required. Some of these 
properties were mentioned earlier and they will be covered in greater detail in the 
next chapters. 

Besides the expectation of a heavier top quark, its mass was, for a long time, 
unknown. Only with the discovery of the W and Z bosons and the precise measure- 
ments of their masses and widths did it become possible to put a limit on the top 
quark’s mass. A stringent limit is provided by the p parameter, where the correction 
from the self-energy has a quadratic dependence on the quark mass. We quote the 
result here in order to stress how quantum corrections become important. The p 
parameter is defined as 


2 
pae ees 
MŽ cos?0w 
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and has the value unity (see Eq. (12.19)). The loop correction from the top—bottom 
quark pair changes this ratio to 


2m?m? m? 
p=1+ soa (m? +m — = L T In (=) + o (14.49) 
For very accurate values of Mw, Mz, and the mixing angle, the effect from the 
top quark is noticeable. Several analyses along these lines gave a range for the top 
quark’s mass in the neighborhood of 175 GeV, with an error of +25 GeV, within 
which it was eventually discovered. 

The dominant decay of the top quark is into a bottom quark plus a W boson. 
The calculation of the decay is straightforward and we have formulated it as an 
exercise. If we neglect the mass of the bottom quark, the decay width is 


GpM? Ma 
T(t > bWt) = LIV, eh = a) : (14.50) 
8/2 $ 


which grows rapidly with the top-quark mass. For M, = 175 GeV, the decay width 
is 


r(t > bW*) = 1.55 GeV, 


which corresponds to a top-quark lifetime of 0.4 x 107? s. The confining effects of 
the strong interactions act on a time scale ~ 1 /Agcp. This means that the top quark 
decays long before the interaction can act to produce hadrons. Unlike the properties 
of the other five quarks, there are no bound states and no toponium spectroscopy. 

The fact that the mass of the top quark is close to the value required by the 
radiative corrections is a success of the electroweak theory. These and other tests of 
the theory will be discussed in Section 17.2 devoted to precision tests of the theory. 
Another test involves the value of Va, which, according to the analysis of the CKM 
matrix, must be very close to unity. There is already experimental evidence that the 
top decays primarily to a bottom quark, but the accuracy is still very poor to test 
the unitarity prediction in Eq. (9.49). 

The ultrarapid decay of the top quark implies that the production of tt pairs 
in hadron collisions should be calculable in perturbative QCD. Electron—positron 
colliders could be used to search for the reaction 


ete —> tt, 
but at the lower energies available in those colliders it is impossible to produce the 


pairs. The decisive experiments have been carried out at the proton—antiproton 
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collider of Fermilab. Some characteristic reactions and their decays are listed 
below: 


p+p—>t+t+ anything > eve + uv, + hadrons, 
— e*u" bb Ær, 
— un + jets Fr. 


The dilepton events (eu, ee, and pu) are produced when both W bosons decay into 
ev or uv. The neutrinos remain unobservable and are represented by the missing 
energy Ær. Events with lepton-plus-jets channels occur when one W decay produces 
e or u and the other decays into quark—antiquark pairs. 

A second challenge to experimenters is the complexity of the events in high- 
energy proton—antiproton collisions. The tt pair produced is accompanied by scores 
of other particles. Separating the top quark is like searching for a needle in a 
haystack. Two experimental groups at the Tevatron collider at Fermilab succeeded 
in discovering the production of tt pairs. As mentioned in the introduction to this 
chapter, the production of tt pairs and their subsequent decays are computed in 
the parton model, using the best possible quark distribution functions, and the top 
quark’s mass is determined as the value which optimizes the fit. 

We close this section with a speculation. The fact that the mass of the top quark 
is larger than the mass of gauge bosons and closer to the scale of the symmetry- 
breaking motivates the thought that the top quark is intimately connected to the 
symmetry-breaking. One suggestion is to study the decays of the top quark to the 
b quark and the W boson, which is expected to be longitudinally polarized. Since 
the longitudinal state of W bosons is developed through the Higgs mechanism, it 
may be sensitive to new physics. In another suggestion, the top—antitop pairs attract 
each other through a new force to make a condensate, which is the Higgs particle. 
We return to this possibility in Chapter 17. 


14.7 Loop diagrams with heavy quarks 
14.7.1 Mixing of states and lifetime differences: a preview 


Heavy quarks also appear as intermediate states in loop diagrams and give dominant 
contributions. As a first example we study box diagrams, in which second-order 
weak interactions change the flavor quantum number by two units and produce 
the mixing of B°_-B°, as well as K°-K®, states. When the intermediate states are 
heavy quarks, they produce a short-distance interaction shown with the diagrams 
in Fig. 14.4. The strategy of the calculations is to construct an effective AF = 2 
Lagrangian from the free-quark model and then take the matrix element between 
B° and B° (or, analogously, K? and K°). We compute the diagram of Fig. 14.4 in 
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Ww wt + 
d j b d we b 


Figure 14.4. Box diagrams for AB = 2 transitions. We calculate the “scattering” 
term in (a). The “annihilation” term (b) is equal to the “scattering” term. 


Section 14.7.2, where it is also shown that box diagrams are finite, i.e. they do not 
bring in any ultraviolet divergences. Before we present the calculation, we quote 
several results from Chapters 15 and 16 in order to emphasize the point that box 
diagrams are indeed important. The interested reader may consult Chapters 15 and 
16 for the underlying physics and then return to this section for the study of a loop 
diagram. 

Let us denote by AMa = Mg, — Mp, the difference in mass between the physical 
states built up from the mesons B? and Bo. We use the subscripts H and L to denote 
heavy and light mesonic states. The structure of physical states is described in 
Chapter 15. The calculation of the box diagram with the top quark in the intermediate 
state gives 


2 


~ 16n2 


#)2 442 . Mt i 
|—Va Vo My EO)Xe,, with x= m (14.51) 
W 


AMa 


3 x3 1 9 1 3 l 
PE ae EO _ , 14.52 
x) 20 —x,3 i È = 41-x% 20- ol 


Xp, = (Baldy,(1 — ys)bby"(1 — ys)d|Ba). (14.53) 


The function E(x) comes from the integration over the loop and Xg, is the matrix 
element of a four-quark operator between Ba and Ba states. Four-quark operators 
appear as overall factors in calculations of box diagrams. The same calculation also 
determines the width difference 


3G? 2 «1/2 
ATa = zzz Mol Va Vio! XB: (14.54) 
In the decay only u and c quarks appear as final states. The decay is discussed at 
the end of Section 16.4 and shown in Fig. 16.1. The dependences of the width and 
A Mg on the quark masses are different. Mass and width differences are related by 


2 
My 


ATi = =x ——_A 
2" ME E(x) 


Ma, (14.55) 
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from which we conclude that for large m, the width difference is much smaller than 
the mass difference. Furthermore, on comparing AM, with the decay width 

7 Gm 
19273 
we note that there is an enhancement of the mass difference arising from the high 


mass of the heavy quark. Thus, for favorable values of the other quantities, A Ma 
can be comparable to Ig. In fact, experiments determined 


Tp [Voc], (14.56) 


AMg = (0.49 0.01) x 10'As7!, (14.57) 


which is one fifth of the width. A similar analysis of the mass difference of B, = (b5) 
leads to a larger mass difference because it is enhanced by the CKM element. 
Experimentally there is the lower bound 


AM, > 13.1 x 10!*As7!. (14.58) 


The discussion shows that loop contributions are important since they determine 
physical quantities and correlate properties of various mesons. 

The physics described for the neutral Bg mesons contains several general prop- 
erties. The down quarks are always lighter than the upper quark of the same family. 
Consequently, mesons containing down quarks decay to quarks of a lighter family, 
for which the decay width is suppressed by a CKM element. On the other hand, the 
mass difference of neutral mesons containing the down quarks involves the square 
of the mass of the upper quark. If the mass of the upper quark in the same family 
is much larger and the values of the mixing angles and the reduced matrix element 
are favorable, then substantial mixing between the neutral mesons is possible. This 
situation is realized for the B9, BY, and K?-K? mesons. 

The opposite situation prevails for neutral mesons that contain a heavy upper 
quark. For example, for D? = (cū) mesons the decay width is proportional to m$ 
and the CKM element is practically unity. The mass difference is proportional to 
me or m2 and the CKM elements are smaller than unity, so there is no substantial 
enhancement. Consequently, mixing of the D? and D° states is small and it has not 
been observed yet. 


14.7.2 Calculation of a box diagram 


We compute first the box diagrams for Fig. 14.4 with intermediate quarks 
i,j =u,c,t. In addition to the diagrams with the exchange of W bosons there 
are exchanges with charged Higgses, which become important when the masses of 
the internal quarks are large. Charged Higgses became the longitudinal degrees of 
freedom of the W bosons. For simplicity we shall assume that the external momenta 
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and masses are zero. In the Feynman—’t Hooft gauge, we obtain 
4 4 : 2 
g dtk —i a k+ mi 
To= | —= Vid; diy" *b 
K (a5) 2 i Gale ae) (a m” ) 


- Kk+m,; 
x (din rat 
J 
d*k 1 k+m 
= dy"(1— 1 — ys)b 
“Gl? | Ga ee ey A ay" = ys) 


x dy(1 — D pall — Ys)b, (14.59) 


with A; = Vý Vis. Simple counting of the momenta shows that the integral is con- 
vergent. The neutrino masses in the numerators do not contribute, because of the 
(1 — ys) structure. The surviving integral has the following Lorentz structure: 

kak 
ali j) = f ak 3 50 a 5 > z 
(k? — Mọ) (k? — mẹ) (k? — m4) 


= I (mi, mj, Mw)8&ag. 
(14.60) 


The fact that the integral is proportional to the gag simplifies the spinor structure, 
which takes the form 


dy"y*y"( — ys)b - dYvYaYu(l — ys)b = 4dy*( — ys)b - dya(1 — ys)b. 
(14.61) 
This and other spinor identities follow from the relation 


yryry? = gy” + gy" — gy’ — jet ysyp, (14.62) 


as demonstrated in Appendix C. 


14.7.3 Integrals in Euclidean space 


In the calculation of the integrals we consider the momenta to be Euclidean, by 
setting 


ko = ik4. (14.63) 


Then the propagators do not have poles. The four-dimensional volume element 
becomes 


d*k = dko dk = idk, d’k. (14.64) 


14.7 Loop diagrams with heavy quarks 177 
A typical integral in loop calculations takes the form 


1 


1 
I%(m) = | dk ——— = i(- 1)” dtk ——______.. (14.65 
m= | ok aayy D f epee 6 
In Euclidean space we can transform to spherical coordinates, 
4 4-1 27? 3 
d'k > k™ dk dQ,4 with dQy = (14.66) 
TQ) 
For our specific integral 
1 4 k? 
I(mi,mj, Mw)=- | dk 
: (K? — My)” (K — mK? — mi) 
k5 
(14.67) 


sa" J ž (k2 + M3,)°(k2 + m?) (K2 + m?) 


14.7.4 Feynman parameters 


The standard way of carrying out momentum integrations in loop integrals is to 
use the so-called Feynman-parameter technique in order to transform a product of 
propagators depending on the integration momentum into a single factor. This is 
accomplished with the following identity: 


1 1 
= — 1)! dx, dx2...dx, 
(a, + i€)(az + ie)... (ay, + i€) fo )!dxı dx2 x 
(l — x1 = es — Xn) 
[aix] + a2x2 +--+ + anXn + ieJ” 


. (14.68) 


Simple examples can be worked out. When the propagator (a; + ie)*, with k an 
integer, appears on the left-hand side, the corresponding formula is obtained by 
differentiating the above expression with respect to a; several times. Following this 
rule we obtain 


1 dx dy(1 — x — y) 
; : oS —z (14.69) 
(a; + ie)(a2 + 1€)(a3 + i£) fa, —x — y) + ax + azy + ie] 
Returning to our integral in Eq. (14.67), 
x? l1—x 
I(m;,m;, Mw) = of axes ff af ay 
k=0 
l—-x-— 
rm (14.70) 


x ; 
[k + M4 (1 — x — y) + m?x + my] 
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Now, since the Feynman parameters in the 6-function sum up to unity, the coefficient 
of the k? term in the denominator is always unity. We can perform the dk? integration 
to obtain 


2, 1 l-x 
T l-—x-y 
I(m;,m;, Mw) = —— dx d 
( j; Mw) az f Í "(x y+ x2r + xy) 
2. te a 1 Boa x? 
— 12M2,2x;-x;Ll1—x, l-x; (l-— x) lnx 
x? 
=! nx: |, (14.71) 
(=e = 


withx; = m? / Ms, and x; = m? / My. This diagram gives the dominant contribution 
for cases in which x; and x; < 1. In the other cases with the exchange of a heavy 
scalar, additional diagrams must be included; the complete answer is given in 
Chapter 15. 

This first example shows that the calculation of loop diagrams involves four 
steps. 


1. A simplification of the spin structure, which reduces the calculation to a few four- 
dimensional integrals. 

2. The reduction of the denominators to a single factor with the help of Feynman parameters. 

3. The completion of the four-dimensional integrals. This is carried through in Euclidean 
space and holds in the Minkowskian region by analytic continuation. The case of the 
box diagram is relatively simple, because the integral is convergent. In the general case, 
the integrals are divergent, which demanded the development of special methods for 
subtracting the infinities, which respect the gauge invariance of the theory. 

4. Finally, there is an integration over the Feynman parameters, which requires special 
attention at the endpoints, where infrared singularities may occur. 


Problems for Chapter 14 


1. Draw the graph of dr?/dx, where x = 2E/W, in the quark rest frame. Multiply by the 
QCD correction factor 


G(x) = Iama — x)[2 + Ind — x)] 


and show how the distribution changes. 

2. Use the identities given in the text to prove Eq. (14.33). Then show that it is possible to 
replace D,, by Diy = D, — D+ vvy,. 

3. For the semileptonic decay B —> Dev the dominant form factor f,(q7) is related to the 
Isgur—Wise function as in Eq. (14.46). 
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(i) Show that fq’) is related to h(v - v’) as follows: 


Fad = hv vB 
mg + mp 
(ii) Calculate the decay spectrum dI /dW in the heavy-quark limit. 
(Hint: you may begin with dT /dq? and then take the heavy-quark limit.) 
4. Use the Feynman rules from Chapter 8 to calculate the decay width for the top quark 
given in Eq. (14.50). 
5. Prove the following useful identities that involve Feynman parameters: 


(i) 1 7 [ dx 
(ai +ieay+ie) Jo [aix +a(1—x)+ie?’ 


y 1 : 3x? dx 
(ii) , =f _ 
(a; + ie) (a; + ie) o [ax +b(1 —x)+iel* 
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CP violation: K mesons 


15.1 Introduction 


Most of the symmetries in elementary-particle physics are continuous. A typical 
example is the symmetry of rotations around an axis, where the angle of rotation can 
assume any value between zero and 27 . In addition to continuous symmetries, there 
are also discrete symmetries, for which the possible states assume discrete values 
classified with the help of a few integers. For instance, snowflakes exhibit the dis- 
crete symmetry of rotations under 60° and crystals exhibit various types of discrete 
symmetries. In elementary-particle physics there are three discrete symmetries of 
basic importance: parity, charge conjugation and time-reversal. 

Parity is the reflection of space coordinates and will be denoted by P. Under 
parity there are two states — the object and its space reflection. Parity is familiar from 
quantum mechanics, where the eigenstates of Hamiltonians are classified according 
to their properties under space reflection. For spherically symmetric potentials the 
wave functions are proportional to the spherical harmonics Y,{(0, œ) whose parity 
is (—1)*. For a long time it was assumed that the fundamental interactions respect P, 
but a critical review of experimental evidence led two theoreticians, T. D. Lee and 
C. N. Yang, to suggest that parity may be violated by the weak interactions. One 
year later, an experiment led by C. S. Wu brought the proof that the P symmetry is 
indeed violated by weak interactions. 

The symmetry of charge conjugation, to be denoted by C, exchanges particles 
with antiparticles. One can imagine building an antiworld by replacing all particles 
by antiparticles. In the antiworld the three interactions gravity, the strong force, and 
electromagnetism are the same, but the weak interactions are different. For example 
in the antiworld muon-type antineutrinos are right-handed and produce u* which 
are also right-handed. In comparison neutrinos are left-handed and always produce, 
in high-energy reactions, left-handed u~. In the weak interactions the C symmetry 
is broken. However, it was assumed, at that time, that the observed processes do 


180 


15.2 General properties 181 


respect the combined CP transformation, the one obtained by applying both C and 
P transformations. 

There is a fundamental reason why CP symmetry plays a crucial role. It is inti- 
mately linked to the time-reserval transformation (T). This transformation consists 
of “looking” at an experiment running backward in time. Although, at the macro- 
scopic level, one can distinguish the real sequence of events from the time-reversed 
one in terms of large-scale phenomena such as entropy or the expansion of the 
Universe, this is not a priori evident for microscopic interactions, i.e. it is not a 
priori evident that the amplitudes for reactions and for the time-reversed reactions 
are equal. 

The analysis of CP violation is facilitated by an important theorem known as CPT 
theorem. It states that any local field theory based on special relativity and quantum 
mechanics is invariant under the combined action of C, P, and T. A consequence 
of the theorem is that CP symmetry implies T symmetry, because any CP violation 
should be compensated by T violation. 

Until 1964 the decays and interactions of particles showed that the CP symmetry 
was conserved; this created the belief that microscopic phenomena also obey the 
T symmetry. In 1964 CP violation was observed in an experiment dedicated to the 
study of K? and K? mesons. Since then it has become an active topic of research, 
with CP violation having been observed so far in the K and the B mesons. In this 
chapter we study the properties of mesons under discrete symmetries, leaving the 
study of fermions for more specialized articles and books. 


15.2 General properties 


We describe now the properties that govern the decays of neutral pseudoscalar 
mesons, such as K®, D°, and B®, when the interactions obey the CPT and CP sym- 
metries. The results guide us to properties of these reactions that indicate breakdown 
of CP and/or CPT symmetries. 

For simplicity of presentation we shall consider the K? as an example and describe 
properties of the K°-K° system; however, the results are general and hold for the 
other mesons too. We adopt the phase convention 


P\|K°) =|K°), P|K°) =|K°), (15.1) 
CIK?) = |K°), (15.2) 
TIK?) = (K°], (15.3) 


with similar relations being valid for the antiparticle |K°). Even though |K?) is 
a pseudoscalar particle, we chose a convention that under parity |K?) transforms 
into itself, since with this choice it is easier to keep track of the minus signs. The 
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freedom to make this choice comes from the fact that parity transformation requires 
P? = 1; thus there is still freedom of the overall sign. 

The decays of the mesons are mediated by the weak interactions, whose oper- 
ators are Hermitian. Non-Hermitian terms appear in loop diagrams from energy 
denominators; however, Hermiticity still holds when we consider the dispersive 
and absorptive parts separately, as we discuss next. For instance, the semileptonic 
decay has the amplitude 


ay = (x lt v|Hw|K°) (15.4) 


with Hw the weak Lagrangian, which is Hermitian. Similarly, the contributions to 
the mass matrix, with i and j being K? or K®, have the general form 


Hi; = mdi; + (ilHwlj) + X SEE ALA (15.5) 


Mx — En + ie 


The first two terms appear fori = j. The last term originates from box diagrams and 
is present for AS = 2 transitions with j = K? andi = K?. The last term is decom- 
posed into two Hermitian matrices by decomposing the energy denominator into a 
principal part denoted by P and a ô-function term. The Hamiltonian decomposes 
as follows: 


i 


Hi; = Mij = alii (15.6) 
with a dispersive term 
(| Hw|n) (n| Awl 7) 
Mij = mxdij + (i|Hwlj) + P 3 MEE (15.7) 
and an absorptive term 
Tij = 2x Di] Hwln)(n| Hw j)5(En — Mx). (15.8) 
These terms satisfy the Hermiticity relations 
Mij = Mj; and Vj; = ji. (15.9) 


More relations follow from CP and CPT invariance. We present the conditions 
as two theorems. 


Theorem1 For a Hamilton operator that is CPT-invariant the amplitudes for the 
decays of particles and antiparticles are the complex conjugates of each other. 


Proof Let us denote the amplitude for K? decay by 


A, = (X;|Hw|K°). (15.10) 
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Then, using (CPT)Hw(CPT)' = Hw, we obtain 


A; = (X7|\(CPT)~|Hw(C PT)|K°) 
= (K°|Hw|X1) = At, (15.11) 


where |X ;) = CP|X,), i.e. the CP conjugate state and 
A, = (X;|Hw|K°). (15.12) 


By applying the theorem to the diagonal elements of the mass matrix H;; and 
using Hermiticity of the dispersive and absorptive parts, we obtain 


Mi, = Mo and Ty = T22. (15.13) 


This is the statement that CPT invariance demands the equality of masses and 
widths for particles and antiparticles. When we apply the theorem to off-diagonal 
elements, we obtain the relation 


Mn = M} and Tp =T%,, (15.14) 


which is not new, but the Hermiticity relations in Eqs. (15.9). 
It follows now that the mass matrix in the |K?) and |K°) has the form 


i MS ia Mpi 
M—tre 11 a: 12 T i (15.15) 
Mi E 3i Mi — zlu 


As mentioned already, the form of the diagonal elements follows from conserva- 
tion of the CPT symmetry. We can make them different, thus introducing by hand a 
violation of CPT invariance, and study the modifications in the lifetimes and other 
properties of the states. 

The presence of the off-diagonal matrix elements implies the mixing of the states 
K? and K°. The physical states are a mixture of them, obtained by diagonalizing the 
mass matrix, which will be presented in the next section. Additional restrictions, 


which we describe in the next theorem, are introduced by CP symmetry. 


Theorem 2 For a Hamiltonian that is CP-invariant, the decay amplitudes for 
particles and antiparticles are relatively real. 


Proof 


(i) As before we denote by A; and A, the decay amplitudes for particles and antiparticles, 
respectively. 
(ii) CP invariance implies 


Hw = (CP)! Hw(CP). (15.16) 
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(iii) Then the matrix elements are related, 


Ar = (X1|\(CP)' Hw(CP)|K°) 
= (X;|Hw|K°) = A. (15.17) 


We can apply the theorem in cases in which |X) is a specific final state or in the 
case |X;) = |K?) which refers to the mass matrix. 

Let us consider the latter case first. Taking |X ;) = |K°) and |X,;) =|K°), we 
obtain the relation 


(K°|Hw|K°) = (K°|Hw|K°) . (15.18) 


In this and the following equations Hw can be the lowest-order Lagrangian or 
may include higher-order terms with possible contractions between fields. It follows 
now that, when we consider the dispersive and absorptive terms separately, they 
are relatively real. This is a stronger restriction than the Hermiticity requirement of 
Eq. (15.15), where they were complex conjugates of each other. We shall use these 
properties in the next section, where we will define the parameter €. 

For decays of particles the theorem says that the amplitudes for particles and 
antiparticles are relatively real. This indicates a strategy for detecting CP violation. 
It consists of measuring the phase difference of the two amplitudes relative to a third 
standard phase, such as the phase occuring in the time development of states, the 
phase in a Breit—Wigner propagator, or some other known phase. We shall describe 
several methods in the next sections. 


15.3 Time development of states 


In the following sections of Chapter 15, we shall assume that CPT is a good sym- 
metry of Nature and study cases in which the CP symmetry is broken. The fact that 
there are off-diagonal elements in Eq. (15.5) means that | K °) and |K°) are not mass 
eigenstates but the physical states are mixed states. The physical states are obtained 
by diagonalizing the matrix in Eq. (15.15). Beyond the solution of the physical 
states we are interested in learning how the elements Mj, and I’), are produced. In 
gauge theories they originate from box diagrams and lead, for the various mesons, 
to terms of different magnitudes, so that the physical properties of K°, D°, and B® 
mesons are very different. We describe first the time development of the states. 

We are interested in defining a state that is a superposition of |K?) and | K°) and 
has the time development 


Wi(t) = (B,|K°) + DKO)”, (15.19) 
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with Bı and D; being constants. The time evolution is described by the Schrödinger 
equation 


2 th=(M = t 15.20 
iFM =( -i5) no. (15.20) 


whose stationary solutions are determined by the eigenvalue problem 


H Ay By = Bı 
@ a G 7 Bia) ee 


with a similar equation for the second eigenvalue E>. 
The solutions have energies E12 = H +4 HHn and the eigenfunctions 


1 

HBN: ~2 

halip ii 4 (2) ter. 115.22) 
1 


respectively. We have written the wave functions in terms of the parameter q /p 
and have not yet normalized them. The reason for this is that one frequently uses 
another parameter, €, which is defined by 


an isa Mi Ua _ oe 
p ite [My — G/2)T n] i l 


which will be used later on. The dynamics of each problem resides in the matrix 
elements of the Hamiltonian, which are calculated in terms of the box diagrams. 
We describe the results of these calculations in the next section. They provide us 
with values for Mı2 and T12, which turn out to be complex functions indicating 
CP violation in the mass matrix. 

Let us discuss the physical states. On substituting q / p into Eq. (15.19), we obtain 
at t = 0 two normalized states, 


|Ks) [d+ e)|K°) + — e)|K°)], (15.24) 


~ AA + eD 


IK.) [+ )|K°) — (1 — e)|K°)]. (15.25) 


V2 + le)? 
Each state has its own time development given by e~'#s1', The subscripts S and 
L indicate the short- and long-lived states. When £ = 0, |Ks) and |K) are even 
and odd eigenstates of the CP operator. For Re ¢ Æ 0 the states are no longer CP 
eigenstates, indicating that the symmetry is broken in the construction of the states. 

There are two special properties we wish to discuss. The states are not orthogonal 
to each other, but have an overlap 


2Ree 
1+ el?’ 


(Ks|K ) = (15.26) 
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This follows from the fact that the mass matrix, in general, is not Hermitian. The 
second property occurs when € is purely imaginary. We can use the states |K?) and 
|K?) or new states 

|K°) =e|K°) and |K°) = eK?) (15.27) 


rotated by a constant phase a. A purely imaginary £ can be eliminated by the 
appropriate redefinition of the states in Eqs. (15.24) and (15.25). The real part of 
e cannot be eliminated. A real part of ex has been established for the K°-meson 
system. For B? mesons £g is, to a good approximation, purely imaginary and there 
is no CP violation in the construction of the physical states. 

For the time development of the states we separate the eigenvalues into their 
respective dispersive and absorptive parts, 


i 
Ms. — zT se = Es = E12, 
which define the mass and width differences 
i 
ML — Ms — zT: — Ts) = 2y Hi2 H21. (15.28) 


A general state is a superposition of the physical states Ks and Kı, with constant 
coefficients C1 2 describing how the state was created at time t = 0: 


w(t) = C e™iMs-4/Is1t] Ko) + Cre IM. - W/L K). (15.29) 


The decay of the state proceeds through strangeness-changing couplings, which 
requires that we rewrite them in terms of |K °) and |K°). The time development of 
a state that at time t£ = 0 began as |K?) is given by 


q = 
wilt) = NÍ FOK”) + Loko |, (15.30) 
with 
fe = e™Ms-0/DTs1 4 gL G/T LI (15.31) 


with N a normalization constant. Similarly, a state that starts at t = 0 as |K °’ has 
the time development 


y(t) = v'[ poi + 1 foko |, (15.32) 


These equations indicate that a state that started as a pure |K?) will develop in time a 
|K?) component through the interference of the two terms. The fact that it involves 
an interference phenomenon makes possible the separation of the amplitudes, as 
well as determination of the factor q /p. 
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Let us consider an experiment in which a state |K °) was created. This state will 
evolve into a mixture of both K? and K°. In fact the probabilities of finding at time 
t the |K?) and |K?) components are 


KK 1K°@)) 7? = AOP = wer te! + 2cos(AM te" 


and 


2 2 


[e™ +e! — 2 cos( AM tet] 3 


[(K°1K%(t))/? = | FO a 
P 


1 
p 


= 


with Ti = Ts, r2 = TL, and I = is +T). Similar formulas hold for a state 
that starts as |K°(t)). The detection of |K°) or |K°) in the final state is carried out 
by observing their decay products. Thus the final formulas involve an additional 
amplitude, which introduces its own phase. The time development of the states 
allows the accurate determination of AM and of relative phases. In fact, this property 
is used heavily in the analysis of experiments. 


15.3.1 Simplified formulas for K? mesons 


Numerous experiments with K-meson beams were able to determine AM, AT, 
and the parameter £x. The results suggest several approximations that simplify the 
equation considerably. For neutral K mesons the mass and width differences are 
comparable: 


AMx = M, — Mg =3.52x 10° GeV, AT x =I, —T's =—7.36 x 107" GeV. 
(15.33) 
Measurements in the decays of the particles determine ex to be small, 


lex| = (2.27 + 0.02) x 1073, (15.34) 


with a phase of approximately 45°. At the end of this section we describe an 
experimental method that determines |éx|. 

For small ex the exponent € which occurs in Eq. (15.23) is small and allows the 
following approximations: 


Ayn © VHH (l — i$), 
Ay, © YHn HaC + i£). 
From the definition of ex it follows that 
Hn — Hz 


EK = 
Hi + Ha + 2V H2 Ha; 
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and, using the above approximations, 


iImMyt+simly ilmMy)+5ImMlyp ass 
Ek = - - i ; 
2J HHz; AM — (i/2)AT 
The fact that the phase is 45° means that Im Mj. and ImT;2 are comparable. 
Furthermore, the magnitude of |ex| implies that the denominator is much larger, 


i.e. 


Im M2, Im. < ReMi or Rely, (15.36) 


giving the final result 


H = Re Mp — 5 Re Pin (15.37) 
and 
JHo H, © Re Mp — 5 Re re (15.38) 


The simplified formulas of this section hold only for the K? system. For this case 
the mass difference is 


AM =2Re M (15.39) 
and the width difference is 
AT =2Re[ jp. (15.40) 


We shall discuss the theoretical determination of these quantities in the next section. 
Before leaving the discussion of the K mesons, we discuss the measurement of 
Re £g from semileptonic decays. 

Let us consider a beam that consists of Kj, mesons. This beam is created in 
accelerators by producing intense beams of K? or K? mesons and setting up an 
experiment far away from the production region, where the Ks particles have already 
decayed. Next we consider the decays 


K > mlv (15.41) 
and 
K, > ney, (15.42) 


distinguished by the charges of the pions and leptons. We denote the decay ampli- 
tudes as 
ay = (x ft v|Hw|K°), (15.43) 
Gy = (xt L 0|Hw|K°). (15.44) 
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Assuming CP invariance for the amplitudes, Theorem 2 says that the two amplitudes 
are equal, 


ag = ag, (15.45) 
which, together with the definition of Ki, determines the asymmetry 


TK, > m ttv) rK > ny) 2 Ree 


ô= = : 15.46 
MK, > wetv)+ TK, > ntt) 1+4 lel? ( ) 

The experimental value for the asymmetry is 
ô = (3.27 + 0.12) x 107°, (15.47) 


which is consistent with the magnitude and phase given earlier in this section. The 
separation into magnitude and phase is obtained by comparing the semileptonic 
decay with the K; — 77 decays, which we shall describe in a following section. 


15.4 The K°-K° transition amplitude 


The theoretical calculations of the matrix elements M7 and I’), are far from being 
understood. For these as well as other matrix elements, there have been developed 
several methods that provide acceptable values and even make successful predic- 
tions. For K? mesons there are short- and long-distance contributions, with the 
dominance of the short-distance contributions being harder to justify, because the 
strong coupling constant s(q°) is large and the quarks are confined into hadrons. 
For the B? mesons, on the other hand, the dominance of the top quark in intermediate 
states makes short-distance dominance more reliable. 

The diagonal elements of the mass matrix are created by the strong interactions. 
The off-diagonal term Mj involves a AS = 2 transition and receives contributions 
from the box diagrams, as described in Section 14.7. The method consists of calcu- 
lating an effective AS = 2 Hamiltonian in the free-quark model and then taking its 
matrix element between the K? and K? states. The AS = 2 Hamiltonian generated 
by this method was described in Section 14.7: 

AS=2 GT vg 2 2 
Hgo cE -gga Mw Qas=2[Ac E) + 2AM E (Xc, X) HATE], (15.48) 
with the various terms defined as follows. The variable x; = m? / Ms, and the 
couplings of the quarks at the various vertices produce the factors 


di = ViaVis- (15.49) 


Their numerical values are determined by the CKM-matrix elements with 4, being 
of O(A) and A; of O(A*) in the Wolfenstein parametrization. The functions E(x.) 
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and E(x., x+) are obtained from the integration over the loop. We wrote down E(x;) 
in Eq. (14.52) and the second function is given as 


ae oe i Si aT 
E(xi, Xj) = —xix; + lnx; | + (i< j) 


4 20x) 40x} 


Xi — Xj 
a : | (15.50) 
4(1—x,)d — xj) 
For 
m? & My, E(x) > —xe 
and for 


2 2 1 3 
m > My, Ex) > = gtk Mp 


which indicates that the various terms in (15.50) are comparable. 
Finally, there is the operator 


Qas=2 = dy, (1 — ys)sdy"(1 — ys)s, 


which represents the external lines of the box diagram. The matrix element Xx = 
(K°|Q,as—2|K°) contains the long-distance contribution of this calculation. A good 
deal of effort has been invested in its calculation. In various situations so far, we have 
encountered the calculation of two-quark operators (currents) between hadronic 
states, for which there are reliable numerical estimates — sometimes extracted from 
experimental data. Estimates of matrix elements for four-quark operators are less 
reliable and are still a subject of research. A simple estimate of such matrix elements 
is given in Eq. (16.5), which can be taken over for the K mesons by making the 
replacements Fp > Fx and Mp > Mx. 

The absorptive part T12 is in principle also calculable in terms of the box diagrams 
by setting the intermediate states on the mass shell, i.e. replacing the propagators 
by 6-functions. For K mesons the physical intermediate states are u quarks, making 
the absorptive part a long-distance effect. This term is calculated by low-energy 
methods with the intermediate states being 271, 37, ... mesons. The calculation 
carries a large uncertainty because the amplitudes and their relative phases are not 
known. 

The situation is different for heavy mesons, in particular the B°—B° system, in 
which there are many intermediate states with multiparticle final states dominating 
the decay. For heavy mesons the sum over intermediate states will be replaced by the 
quarks and will be calculated as the absorptive part of the diagram. This is known 
as the quark—hadron duality, whereby hadronic matrix elements are replaced by 
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the corresponding quark diagrams. Finally, the matrix elements are taken between 
hadrons, for which approximations are again necessary. 

We close this section by deriving two approximate formulas describing the mass 
and width differences of neutral B mesons. The general formulas for width and 
mass differences are 


AM = 2R¢| (m = DG = 5 a) (15.51) 
AT = -4mm (mx å sto)( z ire) (15.52) 


For these equations, we need assume only that the CPT symmetry is exact. For K? 
mesons the data imply the approximations which were described in Section 15.3.1. 
For the B mesons the situation is different. Estimates of Ij. and Mj using the 
effective Hamiltonians of this section lead to the estimate (Paschos and Tiirke, 
1989) 


T}2 oA 0.1Mı2 
and with almost the same phase; consequently for B mesons 
AMB = 2|Mj2| and ATB = 2|T12] (15.53) 


to a good approximation. The differences in the masses and widths of the K? and 
B? mesons indicate that each system must be treated separately. The qualitative 
differences are understood in terms of the quark substructure which enters the box 
diagrams. 


15.5 CP violation in amplitudes 


Besides the phase introduced in the mass matrix the decay amplitudes have their 
own phases. Theorem 1 states that the amplitudes for particle and antiparticle 
decays are the complex conjugates of each other. This is a consequence of CPT. 
CP symmetry goes one step further and requires the amplitudes to be real relative 
to each other. Consequently, evidence for the breakdown of CP symmetry requires 
the measurement of phases. 

In quantum mechanics the overall phase of a sum of amplitudes can always be 
removed, but relative phases among amplitudes are measurable observables. For 
this reason all measurements must include at least two phases and the experiments 
measure one phase relative to the other. 


192 CP violation: K mesons 


Let us denote the final state by (f| and in addition select the final state to be a 
CP eigenstate with eigenvalue unity. Examples of such decays are 


K? > ata, Wn. (15.54) 


There are two decay amplitudes specified by the isospin of the two pions being 
O or 2. Searches for direct CP violation measure the relative phase of the two 
amplitudes and try to establish whether it is the same in K? and K? decays. We 
denote the amplitude 


((27r);|Hw|K°) = Are”, 
((27);|Hw|K°) = Aye® with I =0 or 2. (15.55) 


The phases ô; are created by final-state interactions of the two pions, which is 
a strong-interaction effect independent of the initial state but a function of the 
isospin 7. Beyond the strong phase there is also a phase of weak origin, which 
changes sign as we go from particles to antiparticles. Consequently, we can write 
the amplitudes as follows. 


Ar = |A,\e, (15.56) 
A, = JA, (15.57) 


where 67 is now a phase of weak origin. 

Experiments starting with a |K?) or a |K o beam also observed the mixing 
phenomenon, described in Section 15.3, in the decays to ntn and n?e. At a 
distance corresponding to six to seven lifetimes of the Ks mesons the two amplitudes 
interfere and show a difference. In this way one can separate the ratios 


_ AKL > Ttr) 


~ A(Ks > nT) goe 


N+- 


and 


A(KL > 7°7°) 


= AK, > rir (15.59) 


100 


If CP is a good symmetry (the CP quantum number is conserved), then these ratios 
vanish. The experiments found these ratios to be different from zero. It is customary 
to make an isospin analysis of the amplitudes and write them as 


AR? > À) = ae = Lay (15.60) 
3 3. i 
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and 


A(K® > ata) = [240 + E (15.61) 
3 v3 
and rewrite the ratios in terms of isospin amplitudes. 
Straightforward substitution of the amplitudes gives 


(W/2Ao + Are!) — e (/2.A5 + Adel) 
(V/2Ao + Are!) + etë (/2A% + Ažeis) 
with A = 462 — ôo. It is mentioned here that the ratio is phase-convention indepen- 
dent. A popular phase convention was introduced by Wu and Yang, which selects 
the Ap amplitude to be real and then the answers will depend on the phase of the 
A, amplitude denoted by 62. We adopt this convention; then, by substituting e in 
terms of £ and collecting terms together, we obtain 


n= (15.62) 


E tekos oa a (+.)| faints e] 
E — cos 6) ef — sin) e 
= V2\Ao| VI) \Aol 15.63 
ee ; i£ A2 ( f ) 
1+ — 6, e'^ 6 
i+ J cos 62 €e +(5) PT re] 


The expression simplifies if we neglect the second term in the denominator, since 
&|A2/Ao|sin 62 < 1. In this case 


E 
n (15.64) 
i IFON 
with 
i A2 ; 
g= (=) —|sin 0 e'4 
J/2) | Ao : 
and 
w = |—|cos h e 
0 


On repeating the analysis for noo with the same approximations, we obtain 


2 (15.65) 
no = € — z . 
1- V2% 

In many models both Ao and A, are complex and rephasing of the amplitudes is 
necessary in order to bring them into accord with the Wu-Yang phase convention. 
In summary, in addition to the CP parameter discussed in Section 15.3.1, there 

is the parameter £’. The parameter £ arose from phases in the mass matrix and e’ 
from relative phases in the decay amplitudes. The former is referred to as indirect 


194 CP violation: K mesons 


CP violation and the latter as direct CP violation. It is also customary to define the 
ratio e’/e because the phase 7/2 — A ~ 45° of z’ is approximately equal to the 
phase of £ and cancels out in the ratio. 

Going back to a general phase convention whereby both Ao and Az are complex, 
we should replace 6) by 62 — 69 and the definition of e’/e becomes 


E' 1 [A2 (sind in 60) 

— = — |— (Sın — sin 

€  J/2Ie|| Ao á i 
LEE Ae mA (15.66) 

= ——— m Ao — — Im , 4 
/2 e Re Ao Dae ? 
with 

i A> oe Re A> ME 1 
~ [Aol Redo 22.2 


The remaining problem is the calculation of the imaginary parts of the amplitudes 
in Eq. (15.66) because the real parts are much larger, with their numerical values 
known from experiments. 


15.6 The effective Hamiltonian 


K-meson decays involve low-energy interactions mediated by the exchanges of 
hadrons and at least one W boson. It is customary to appeal to the quark-hadron 
duality and replace the hadrons by quarks and gluons. The weak interaction is a 
short-distance phenomenon that is represented by the couplings of the W to quarks. 
This is not the only part of the interaction, because there are strong interactions 
produced by the exchanges of gluons. A complete calculation must include both of 
them. Thus a method has been developed in field theory for this purpose. The method 
consists of summing the leading logarithmic contributions of the diagrams. The final 
result is an effective field theory with the W and the heavy quarks eliminated or, as 
one says, “they have been integrated out.” 

Even if we start with one weak operator at momenta comparable to Mw, the 
exchange of gluons introduces more operators coming from loop diagrams, like 
penguin and box diagrams. The effective Hamiltonian has the form 


Het = >) Car(Mw, 1) Qav(), (15.67) 
a,b 


with Qalu) = Gx) Vaq(x)g@)Ipq(x) with Ta and T, being matrices in Dirac 
space. The constants Cap( Mw, u) are the coefficient functions (Wilson coefficients) 
obtained from the renormalization of the operators. They depend on a high energy, 
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My, a low energy scale, jz, and the quarks in the intermediate states. Their general 
form is 


E 


My \? 
Ci;(Mw, u) ~ m(x) ; (15.68) 
u 


which is obtained by integrating renormalization equations of quantum chromo- 
dynamics (QCD). The exponent y is known as the anomalous dimension and b 
arises from the running of the coupling constant. The calculation of the coefficients 
and their accuracy is a theoretical topic of active research whose study is beyond 
the scope of this book (Buchalla etal., 1996). 

In order to give a general impression of the results, we present here the effective 
Hamiltonian for K-meson decays. As mentioned already, it depends only on the 
light quarks and contains eight operators: 


a Gera A 
He = Fi S [CHW ru+ CEDA +CKWA]O; for p< me, (15.69) 
i=l 


where Ag = Via V,s are again the couplings from the CKM matrix, the unitarity of 
which implies 


Ay tac tar =0 (15.70) 


and makes possible the elimination of one of them. Once we decide to eliminate 
Au, the coefficient functions will appear as differences Cf — C? and Ct — C}. The 
substitution makes the coefficient functions less sensitive to the up quark. The 
operators which appear are defined as follows: 


Q1 = 4SLy“dLŪLYLUL, Q2 = 45y" uL aya, 

Q3 = 4% aya JLYuqL, Q4 = 4X SLY quyuda, 
q q 

Q5 = 4X ay"d ARV uIR> Q6 =—8 ar: Gra, 
q q 


a : 3 2 
Q7 = a easy” dL ORV u9R> Qs = —$§ 2 7 aL AR Grd. (15.71) 


Operator Q72 is the original charged-current operator and Q is generated from box- 
type diagrams, where in addition to the W a gluon is being exchanged (Fig. 15.1). 

The penguin diagrams in Fig. 15.2 generate Q3,..., Q¢. Finally, penguin dia- 
grams with the exchange of photons generate Q7 and Qg (electroweak penguins). 
Since the penguin diagrams are important, we present several steps of the calcula- 
tion in Section 15.7, where it is also explained how the penguin diagrams generate 
the various operators. 
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Figure 15.1. Tree and box diagrams. 


q > > q 
Figure 15.2. A penguin diagram. 


Finally, one may also show that not all the operators are independent, since they 
satisfy the relation 


—Q,+ Q2 + Q3 = Q4. 


The coefficient functions C;(Mw, m) originate from the short-distance interaction 
of QCD and have been calculated at the one-loop, as well as the two-loop, level. 
The hadronic matrix elements 


(Qiwi = (a7, IQ: WIK?) (15.72) 


originate from long-distance interactions, since they involve low energies and 
momenta. They represent the low-energy limit of QCD and must be calculated 
by low-energy methods. They have been the subject of various calculations, which 
we shall mention briefly. Within the framework described in this section we can 
outline the calculation of e’/e. 

Among the amplitudes which enter the calculation, Re Ag and Re A, are taken 
from experimental data (Devlin and Dickey, 1979): 


Re Ay = 0.338 x 10°°GeV and ReA>=0.015 x 107° GeV. 
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They are much larger than their imaginary parts. The amplitudes Im Ag and Im Ay 
are calculated in terms of the effective Hamiltonian 


8 
Im A; = a XO [(Cf — CP)Ima, + (Cj — C?)Im Ae] (Qi). (15.73) 
i=l 


With a phase convention of the CKM matrix whereby Vas and Vaq are real, the 
unitarity of the CKM matrix provides one additional relation, 


Im A, = —Im Ay = 0° An. (15.74) 


Substitution of this relation eliminates the u quarks in intermediate states, since the 
Wilson coefficients for top and charm quarks are subtracted from each other. This 
leads to the result 


Ee Grow 1 
€ /2|e| Re Ao 


: 1 
Im a Diw ow > 1 (2:00) (15.75) 


with yi(y) = C}() — C?(u) and w = Re A2/Re Apo. The superscripts denote con- 
tributions from top and charm quarks in the intermediate states. The unitarity of the 
CKM matrix helps by eliminating the Wilson coefficients of the u quarks and mak- 
ing the QCD contribution sensitive to the energy scales between m, and m, where 
the short-distance expansion is acceptable. The Wilson coefficients are available 
and have been tabulated (Buchalla et al., 1996). 

The hadronic matrix elements have been the subject of numerous calculations. 
From the early estimates it was evident that (Q6)q plays an important role. The 
matrix element is generated by the penguin diagrams and, since it involves pseudo- 
scalar densities, it is enhanced. In chiral perturbation theory it is expressed in terms 
of coupling constants divided by the mass of the strange quark. It was also calculated 
by vacuum saturation or the tree contribution of the chiral perturbation theory. It 
was noted that the lowest-order contribution must be supplemented by chiral loops 
(Bardeen et al., 1987, 1998). The final results indicate that (Q¢6)o is important, 
especially because it is further enhanced by contributions from chiral loops. 

An additional complication arises from the matrix element (Qg)2, whereby in 
the penguin diagrams the gluon is replaced by a photon. It was argued that the elec- 
troweak term can be very important because it is multiplied by a large factor, 1/a. 
Calculations in chiral perturbation theory indicate that its contribution is moderate 
and it is further reduced by loops. A good approximation consists of taking the 
dominance (Q¢6)o and (Qg)2. As an illustrative example we give typical values for 
the Wilson coefficients, 


ye=—0.110 and yg=1.15x 107%, (15.76) 
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and the matrix elements, 
(O6)o = —3.4GeV> and (Qs) = 0.46 GeV’, (15.77) 


obtained in chiral perturbation theory for m, = 150 MeV. The CKM term is pre- 
cisely known to be 


Im A, = (1.35 £ 0.35) x 107°. (15.78) 


On collecting all terms together in Eq. (15.75), we obtain the value 


(=) = 16.9 x 10-4, (15.79) 
E/K 


which is consistent with the experimental values. The contribution of the electro- 
weak penguin is less than 20%. Many quantities entering the calculation carry 
uncertainties and the final range for the ratio is larger, in the range (10—20) x 1074 
(Hambye et al., 2000). Calculations in the chiral quark model give a similar range. 
These values are consistent with the newest experimental values, 


(£) 7 hee x 10-4  NA(48) (Fanti et al., 1999), (15.80) 
E/K 


e (22.7+2.8)x 10-4  KTeV (Alavi-Harati et al., 1999). (15.81) 


At this time it seems that the experiments are more precise than the theory. This 
is the outcome of four large experiments that invested great efforts in measuring 
precisely decays and interference phenomena in K-meson decays. 

It would be an omission not to mention a good deal of work done on lattice gauge 
theories, which tries to determine the matrix elements. Unfortunately, their results 
are not stable enough yet. They give a wide range of values for the matrix elements 
and the CP parameter. 

This is an introduction to the calculations of the CP parameter intended for 
students who may use it as a guide to the published articles. The bottom line is 
that theoretical analyses in the standard model are consistent with experimental 
measurements. The CKM paradigm gives a consistent — albeit not very accurate — 
picture for the K-meson decays and it remains to find out whether it continues being 
successful for mesons containing heavy quarks. 


15.7 Calculation of a penguin diagram 


In K-meson and B-meson decays an important contribution comes from the penguin 
diagram. We have mentioned already that in Eq. (15.71) the penguin diagram with 
gluonic corrections produces four operators. It is worthwhile to give several steps 
of the calculation, which demonstrate how the various operators are generated. This 
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Q Q 


Figure 15.3. Momentum assignments for the penguin diagram. 


section contains long algebraic manipulations and is presented here for those who 
are theoretically inclined. 

The notation for the penguin diagram is introduced in Fig. 15.3. The external 
momenta are those of the strange and down, quarks, which correspond to typical 
momenta within light mesons and are small relative to the mass of the W boson. For 
this reason external momenta are kept in the spinors but will be neglected within the 
four-dimensional integral. A peculiarity of the penguin diagram is the presence of 
the gluon propagator with momentum q, which is kept throughout the calculation. 
Following standard rules, the matrix element is written in the form 


Bai : k+d+m, a% k+m;,_, 1 d*k 
Tga) VORI — m2 Bm? VI RME ny 


Mp i 
- A% 
x Oy’ O Va Vid: (15.82) 
The quarks in the loop and their masses are denoted by the subscript i and m;, 
respectively. The index for the intermediate quarks occurs also in the CKM matrix 
elements Vjs and Vig. The rest of the notation is standard, with gw and g, the 
weak and strong coupling constants, respectively, y- = (1 — ys), and à” the color 
matrices. 
We follow several of the steps for the calculation of loops described in 
Section 14.7.2. We rewrite the matrix element as 


8w g re _ fe 
Mp = Snr unz )vav"y-dO (=) QI (mi, q), (15.83) 
q 2 2 
with 
dtk k ak 
Tup(mi, mi, q) = (k + q)akg V* Va. 


* 
(27)? [(k + q)? — m?](k? — m?) (k? — MẸ) 

(15.84) 
In this way we have separated the spinor structure from the four-dimensional 


integral. We ignored fermion masses in the numerator, since they are small relative 
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to the integration momenta, and set s¢d = 0 because we consider masses for 
external quarks that are small. There is a aaa divergence independent of 
the quark masses. It is multiplied by $`; V% V;s = 0 and vanishes. Similarly, in the 
limit in which the gluon momentum anster q? is much larger than all internal 
quark masses, it is easy to check that the penguin diagram vanishes. However, for 
problems involving q? « m2 or m? the cancellation is not complete. 

The remaining two integrals appear with the factors k,kg or qakg in the numer- 
ator. Using the method of Feynman parameters and the integrations described in 
Problem 4, the dominant contribution of the integral for me, m, & Mw is 


dadp 1 MENTL 1\a 
lyetm;, q) = — -1 ae VV, 15.85 
BNO a2 eae n( moe oye Oe) 


with the i coming from the k,kg term and the 5 from the g.kg term. Finally, we 
simplify the spin structure by using known identities: 


q%qP5YgYvVay-d = 5(2qy — Wd)dy_d = -4°53 y»y-d. (15.86) 


The q? factor cancels out the gluon propagator in Eq. (15.82). 
On collecting terms together, we arrive at the final result 
G as MŽ 2 
Mp = — > In| — ) syad Oy" O(V;. Via). 15.87 

P J lan a( e)a LOYA" OV; Via) ( ) 
We note that the coupling 5L ... d contains left-handed quarks, in contrast to the 
gluon coupling Qy, Q being a vector. By decomposing the latter into left- and right- 
handed couplings, we generate two distinct operators. Finally, using an identity for 
the product of color matrices, 


1 
yee he = a(s ilôjk — 38) (15.88) 


we double the number of operators. In the end, the penguin diagram generates four 
operators, Q3, Q4, Qs, and Q6, which were absent at the tree level. 

There are two ways to treat the penguin diagram. One of them considers its 
contribution as a short-distance operator creating a four-fermion interaction among 
the quarks. This would be the case when the top quark dominates a process. The 
exchange of additional gluons may still be soft and some sort of summation is 


i 


again necessary. A final step is the estimation of the four-quark operator between 
hadronic states. 

The alternative method considers the four operators generated by the penguin 
diagrams as basic operators and sums up higher-order QCD corrections. This is 
achieved by considering gluonic corrections to each of the operators Q1, ..., Q6, 
which renormalizes and in addition mixes them up; that is, gluonic corrections to 


References 201 


one operator generate several of the others. The problem to be solved is one of 
coupled differential equations. The initial conditions are defined at high momenta 
when only Q% has an initial value and all other operators are zero. Following this 
method (Peskin and Schroeder, 1995; Buchalla et al., 1996), one arrives at the 
effective Hamiltonian similar to that in Eq. (15.69). The theory is effective because 
the additional corrections are proportional to higher powers of a,(p”), which for 
large momenta become very small. 


Problems for Chapter 15 


. Introduce in Eq. (15.62) the Wu-Yang phase convention, then substitute for e* in terms 

of e and derive Eqs. (15.63) and (15.64). 

(i) The normalizations that occur in Eqs. (15.30) and (15.32) describe how many K? or 
K? mesons, respectively, are present at time t = 0. Argue that for normalized wave 
functions |K?) and |K°) they should be N = N’ = 5. 

(ii) Consider the time development of the state |K?) and the decays to 1+7~. Describe 
the interference term and find an argument justifying the large interference at a 
distance corresponding to six to seven lifetimes of the Ks meson. 

. Show that the operators Q1, Q2, Q3, and Q4 satisfy the relation 


— Qı + Q2 + Q3 = Q4. 


. The calculation of the integral in Eq. (15.84) contains in the numerator two terms: one 
with k,kg and the other with g,kg. Write each of the integrals in terms of Feynman 
parameters. The four-dimensional integrations are of the form 


f dtk {ka, kakg} 
(27 )* (k? + 2k- p — A} 
= 1 {= Pa; PaPh } 
32m7i| A+ p* +ie 
The function A contains masses of the quarks, the mass Mw, q?, and Feynman parame- 
ters. The constant Ao is cut-off-dependent but independent of quark masses; it disappears 
when we sum over the quarks in the loop. The remaining two integrations are elemen- 


tary. Arrange the integrations in an appropriate way to extract the leading In(Mw/m;) 
term and obtain Eq. (15.85). 


1 
+ fo 58eg In(A + p +ie)+ Ao}. 


Comment We described the integrals in the limit m; <x My. For m, > Mw you can 
again study the elementary integrals and obtain a modified logarithmic term. 
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CP violation: D and B mesons 


16.1 Introduction 


In previous chapters we mentioned that the analysis of bound states with heavy 
quarks varies from meson to meson. For heavy mesons the hadronic structure 
becomes simpler and approaches the spectator model with corrections given by 
HQET. Their weak properties, on the other hand, remain distinct because they 
depend on the interplay between CKM couplings and the masses of the mesons. 
For this reason we discuss the mixing and CP violation in D and B mesons in a 
separate chapter. 

The system D°-D?° is quite different because the decay width is much larger than 
the mass and width differences, which makes the observation of mixing and of CP 
asymmetries very difficult. In fact, they have not been observed yet. This motivated 
several authors to suggest that the observation of these effects, at a higher level than 
expected, will be an indication of contributions beyond the standard model. 

The observation of these effects in the neutral B mesons was experimentally 
promising because the decay of the b quark to its heavier partner, the top quark, 
is not possible for kinematic reasons. The suppressed decay width is comparable 
to the difference in mass of the Bg mesons. Consequently, the mixing over their 
lifetimes is substantial and has been observed. In addition, the mixing of the states 
provides another phase that interferes with phases of decay amplitudes and produces 
oscillations observable in the experiments. These are some of the topics to be 
covered in this chapter. 


16.2 The D°-D° transition amplitude 


The calculation of the mass difference for Ks and K; mesons in terms of the box 
diagrams gave a sizable fraction (>50%) of the observed value. This suggests that 
similar calculations for heavier mesons may be more accurate. Indeed, estimates of 
AM and AT for D? mesons give small values relative to the decay width of these 
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particles. For this reason, mixing of the neutral D states and CP asymmetries have 
not yet been observed. Similar calculations for the B mesons give values consistent 
with the data, because box diagrams are dominated by top quarks in the intermediate 
states. 

As mentioned in Section 14.7, mixing of states depends on the size of the decay 
width relative to the value of the off-diagonal matrix elements of the effective 
Hamiltonian. We can estimate both of these terms for D mesons. The decay width 
in the spectator model is given approximately by 

G’m> 


c 2 
= iya lVel. (16.1) 


This should be compared with the following matrix element of the effective 
Hamiltonian: 


2 
16x2 
x(D°|ēy (1 — ys)uēy”( — ys)u|D°). (16.2) 


(D°|Aj2|D°) = My [§2 E (xs) + 2EEE (xs, Xb) + ER E)] 


The factors s = V Ves and & = Vi, Vo are given in terms of Kobayashi-Maskawa 
matrix elements, which in the Wolfenstein parametrization are of order à and 25, 
respectively. The E(x,) and E(x») pertain to strange and bottom quarks and are 
approximated for m? « My, by 


E(x) © —x;. (16.3) 
The reduced matrix element 
Xp = (Dlayyz(1 — ysuatgy"(1 — ys)up|D), (16.4) 


with «œ and £ color indices, is similar to the matrix element encountered for K mesons 
in Section 15.4. An order-of-magnitude estimate is given by the vacuum-insertion 
approximation, which consists of introducing the vacuum state in all possible ways. 
For the above case we obtain 


Xp = (D|ĉa yu (1 — ¥s)ual0)(Olesy.(1 — ys)up|D) 
(D\Eayu(1 — ys)ug|0) Olésy.C1 — ys)ualD) 


4 
1 aD 2 

=(1+ 3 KD|CeYu (l — y5)ual0)| 

= 8 2mp. (16.5) 


The second equation is obtained by Fierzing the second and the fourth spinors 
and the factor of i by transforming the matrix elements to color singlets (use 
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Eq. (15.88)). The decay coupling constant is defined as 


Z ; Fp Pu 
(OlCLYnUL|D) = JIMg (16.6) 


D 
with u; being a normalized left-handed spinor. 

It is straightforward to estimate ratios of the mass and width differences to the 
width. The b-quark exchange graphs are smaller than that for the strange-quark 
exchange, because of the very small b coupling. Estimates of the strange-quark 
graph give 


maid o(10~*) d Ar o(10~*) (16.7) 
= — 7X an = — 7 5 a 
AT =T 


with more precise estimates depending on values for the parameters. Alternatively, 
one may use hadronic intermediate states, but very few amplitudes are known at 
present and their phases are unknown. The above values of x and y give very small 
mixing. 

Since the lifetime of D mesons is very short, only time-integrated effects can be 
observed. They are defined in terms of the transition probabilities | (D°| D°(t)) |? and 
|(D°| D°(t))|?, whose functional forms in terms of widths and masses are similar 
to those for the K mesons given at the end of Section 15.3: 


fk DIDO f SI-O dr 

S KDD fpl SIFO dt 

-|! ? AM? + (Ar/2) -|2 oxy? ae 
~ |p| 272+ (AM)2 —(AT/2)?— |p| 2+x?-— y? 


These equations imply that the expected mixing for neutral D mesons for the values 
in (16.7) will be of order 1078 and any observable mixing must be attributed to 
another mechanism beyond the standard model. 

Even though the estimates for D? are very approximate, the methods we described 
in this section are general and can be taken over for other mesons. The vacuum- 
insertion approximation of Eq. (16.5) has already been used for K mesons and 
we will meet it again in the next section. Vacuum insertion is an approximation 
that several authors tried to improve. It is hoped that lattice gauge theories will 
eventually give precise values. 

The second result of this section, Eq. (16.8), gives the mixing of short-lived states 
integrated over long intervals of time. This is a general result that depends on the 
quantum-mechanical development of a two-state system. In a tagged D? beam, the 
ratio r gives the number of wrong-sign leptons produced in the decays divided by 
the number of right-sign leptons. The wrong-sign leptons are those which originate 
from the oscillation of D? to D? mesons. We mentioned leptons as an example, but 
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they can be substituted by decays into K mesons of specific strangeness. Such 
a ratio was in fact used to determine B°-B° mixing, which we shall explain in 
Section 16.4. 


16.3 Comparison of K? and B? mesons 


It is instructive at the very beginning to compare several properties of the K? and 
B° mesons, because these two mesons are quite different. For the K mesons there 
are two physical states with very different lifetimes: 


T(Ks) = 89.35 ps and T(K,) = 51 700 ps. (16.9) 


This big difference comes about because the mass and width differences of the K 
mesons are comparable: 


1 1 


For the B mesons the situation is very different. The lifetime of the B mesons is 
much smaller: 


1(B) = 1.55 + 0.06 ps. (16.11) 


In addition "jz «K Mj» for Bg mesons, which makes the lifetimes of the two physical 
states almost identical. For this reason we characterize them by their masses, as 
heavy and light, and denote them by By and BL, respectively. From the mixing of 
the two states we know the ratio 


AM 
— = 0.73 £0.18. (16.12) 


The mixing of the B states is described by box diagrams analogous to those in 
Fig. 14.4, with the top quark dominating in the intermediate states. Computation 
of the diagrams gives the mixing parameter, £g, as 


~ = 


p 1+ Via 


with £ the phase of the Và matrix element; see Eq. (16.27). It follows from this 
relation that £g is mostly imaginary with a small real part. Consequently, the leptonic 


te V, 
Ta AB te (16.13) 


asymmetry that was useful in K decays is too small, so another method for observing 
CP violation has had to be discovered. 

Among the interesting phenomena are the mixing of Bg states and CP asym- 
metries which have been observed in decays of these mesons. It is mathematically 
easier to discuss the CP asymmetry because it deals with the time development 
for single states |B°(t)) and |B°(t)). The mixing, on the other hand, observes the 


16.4 Mixing in the By system 207 


correlated development of two states and its calculation is more complicated. His- 
torically, mixing was observed first and the CP asymmetries were observed only 
very recently. We shall follow the historical development and describe first the 
mixing of B? and B? states. So far we have mentioned B4 states as typical mesons 
because experimental results on their decays are available. There are also the Bs 
mesons, which are very interesting because several of the parameters are different. 
We postpone a comparison of B, and Bg states until Section 16.6. 


16.4 Mixing in the By system 


The mixing between Bg and Ba mesons was discovered in electron—positron col- 
lisions in which a B°-B? pair is produced. As the produced pair develops in time, 
the particles oscillate. The time development of each state separately is given by 
the following equations. A state that starts as |B°(t = 0)) develops according to 


|B°) = N| pone) + : som). (16.14) 
Similarly, a state that starts as | B°(t = 0)) develops as 

|B°(t)) = nj f-(t)|B°) + ro], (16.15) 
with 

faH) = eiiD 4 LT) (16.16) 


and N the normalization factor. To describe the oscillation data properly, we must 
use quantum-mechanical wave functions for a B°—B° pair. The pair of B mesons 
created at the Y(4S) resonance is a state with odd charge conjugation with the two 
mesons flying apart from each other with momenta k and —k. The oscillations that 
set in are highly correlated. The time evolution of the pair is now given by 


[B°(e), k)|B°C'), —k) — |B), kB), —k). (16.17) 
It is evident that the decays can take place at different times, with the production of 
leptons through semileptonic decays. For example, we can consider events of the 
type 
ete —> B° +B° 
Ls yty Ls x-¢ty 
or X tv or YU. (16.18) 
Consequently, events are produced in which the pairs of primary leptons emitted are 
etet, er e-, eet, and L78. We denote the corresponding rates by /++, I+~, 17+, 
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and /~~, respectively. The observation of the parameter 


R = (++ a Cr ae) (16.19) 


characterizes particle—antiparticle mixing. We denote by A~~ the amplitude that 
one of the mesons with momentum —k decays at time t’ into £~ and the other meson 
decays at time f also to £~. Using the time dependence of the |B°(t)) and |B°(t)) 
states given explicitly in Eqs. (16.14) and (16.15), we obtain for the amplitude A~— 


A(t’, t)=(€-Y*|H|B°,—k) (€-Y*|H|B°, +k)\(p/@f_@)fi1O- AOD f0), 
(16.20) 


and, in the same notation, 


ATH, t) = (€-Y*|H|B°, =k) (0+ X | A|B, +k) [ff — AOFM]. 
(16.21) 


There are two more equations defining the amplitudes At~(t’, t) and At*(t’, t), 
Paschos and Tiirke, 1989, p. 218). 
For the matrix elements, we introduce the abbreviations 


M = (€-Y*|H|B°, +k), M = (€*X~|H|B°, +k), 
which, according to CPT symmetry, satisfy 
|M| = |M| = M. (16.22) 


It is easy to calculate the rates of decay to each pair of charges by squaring the 
amplitudes and integrating over the times f and t’, separately. After some algebra 
and a few integrations, the final answer is 


2 
| 
q 


with x and y defined for B mesons with AM = My — Mu and AT =T T4 -T. It 
is interesting to note that Eq. (16.23) is similar in many respects to Eq. (16.8). 
Experiments measured the ratio in e~e* collisions and found 


2 S 
x +y 
; 16.23 
oS 


R = 0.23 + 0.09 + 0.03. (16.24) 


For |g/p| = land y < 1, which will be shown later to be an excellent approxima- 
tion, 


AM 
== = 0.73 £0.18. (16.25) 


It is now a theoretical problem to calculate AM and AT for the B system and 
compare it with the above values. 
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Figure 16.1. Absorptive parts contributing to the B°-B° width mixing. 


The mass difference is given by the box diagrams. Estimates similar to those of 
the previous sections indicate that the B°-B? transition amplitude is dominated by 
the exchange of two top quarks in the box diagram. This is indeed a short-distance 
contribution. Formula (15.48) translates into 


G? 7 
M = —TeqiMwX pg Evi, (16.26) 
with 
& = Va Vä = AMC — p — in), (16.27) 
Xp = (Bol by, (1 — ys)dby"(1 — ys)d|Bo), (16.28) 


and 7 ~ 0.87 is a factor originating from QCD corrections. The reduced matrix 
element is parametrized in terms of the vacuum-insertion term times a B factor, 


Xs = SB? by, — ys5)d|0)|° B, = $ Fi MaB». (16.29) 
The factor 3 originates from the various terms in the product of the currents and 
rearrangement of the color indices, as was explained in Section 16.2. 

Before we consider the magnitude of the mass difference, it will be useful to 
calculate the width difference computed as the absorptive part of the box diagrams. 

For the calculation of the absorptive part we set the intermediate states on the 
mass shell. This is equivalent to cutting the diagrams in the manner shown in 
Fig. 16.1 and then integrating over the two-body phase space. 

After completion of the integrations and substitution of the reduced matrix 
element from Eq. (16.29), one obtains (Hagelin, 1981) 


Gm? 
Ty SS = 


8 2 4 
Fie, B| & + &)* — e + 2&€u) + (=) | 
b 


870 y 


(16.30) 


This is a relatively simple formula, which can be compared with Mj. Neglecting 
terms of order (me /mp)’, we observe that I'j2 has the same phase as Mj. This is 
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evident from the unitarity of the Kobayashi-Maskawa matrix, which gives 


Éu + & = —& (16.31) 
and consequently 
G?m? 
c= — = Fa, Mp, BE. (16.32) 


Whenever M7 and T12 have the same phase, 


1 
MPN? V, ; 
q = ( 2) — ic =e 78 (16.33) 
P M12 Va 


as was already given in Eq. (16.13). This ratio will be important for the discussion 
of CP violation. 
A second consequence is the magnitude of I")2 relative to M12: 


r m 2 
N on( =) x 1072. (16.34) 


Numerical estimation of the mass difference gives the value AM ~ 0.731, in agree- 
ment with the mixing of the B? and B° states. 


16.5 Decay rates and CP violation 


B-meson decays are frequently described in terms of quark diagrams, which are 
classified as tree, penguin, or other types of diagrams. The classification is very 
useful since it specifies how the CP phases appear in amplitudes, that is as couplings 
of the CKM-matrix elements. The detailed dynamics are not completely understood 
and we presented several methods for analyzing them in Chapter 14. 

In the spectator model, the simplest diagrams for the decay of a b involve only 
one intermediate W* boson, as shown in Fig. 16.2. 

We denote the decays of the W* boson as ud or c5 and we indicate in closed 
ovals the final hadronic states. Thus the diagrams (a) denote the decays 


Ba > DD] or Ba> wD}, (16.35) 


with the couplings V% Ves and V% Vua, respectively. Similarly, the decays in diagram 
(b) are 


Ba > J/Ņ+K;, or Bg— Dn’, (16.36) 


with the couplings V% Ves and Vi. Vua, respectively. 
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Figure 16.2. Tree-diagram decays of B9. 
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Figure 16.3. A gluonic penguin diagram. 


In addition to the tree diagrams, there are penguin diagrams analogous to the 
ones we discussed in the previous chapter. A typical diagram for gluonic decays is 
shown in Fig. 16.3. 

The final state is determined by the quark assignments. They can produce the 
following states: 


Ba > J/p+K,, Ba— Kg, (16.37) 


Ba > ata, T. (16.38) 


We see that the J/ip + K, decay can originate from tree and penguin diagrams 
whose dynamics are very different. The dominant penguin diagrams are those with 
charm and top quarks in the intermediate states (see Problem 4) whose couplings 
are 


$ Ves = AM? + OQS), (16.39) 
and 
ViVis = —Ad?, (16.40) 


respectively. This CKM coupling can be extracted as a multiplicative factor. The 
property is unique to this decay channel and makes the predictions very reliable. 
For this reason the decay has been named the gold-plated decay channel. The mode 
has several advantages. 


(i) The prediction for the CP asymmetry is reliably expressed in terms of CKM parameters 
and is large. 
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(ii) It has a measurable branching ratio 


Br(B > J/tpK°) = (8.7 £0.5) x 107+, (16.41) 


with the decays J/ p > ntn and K? —> ntr producing hadrons, which are easily 
detectable. 
iii) The final state J/.p + K, is a CP eigenstate. 


For these reasons the decay of B? mesons to J/1p + K; is an attractive mode and 
we describe the decay amplitudes in Problem 4. 

Fortunately there is also an experimental method for detecting CP-violating 
effects (Carter and Sanda, 1981; Bigi and Sanda, 1981). A B°B® pair is produced 
in electron—positron colliders and proceeds to decay. We select a decay mode for 
B° and B® to a common final state | f}, which is an eigenstate of CP. Let 


As=(f|H|B°) ands Arp = (f|H|B°) 


be the decay amplitudes. Since the beams of particles created in the collider are a 
mixture of B? and B? and since their lifetimes are almost identical, we cannot a 
priori tell whether | f) arose from the decay of B? or B°. Thus we need independent 
information on the flavor of the decaying neutral B? meson. This can be achieved 
by observing a semileptonic decay on one side and the decay to |f) on the other 
side. In this way we know whether the decay to | f} originates from a B® or B°. The 
experimental groups measure these decays as a function of time. A small asymmetry 
in the time evolution of the two decays is evidence for CP violation. 

We describe the time evolution of particle and antiparticle decays in detail. The 
decay amplitudes as a function of time are 


1 2 
(f|B°@) = J fea T To, (16.42) 


B 1 x 
(FIBO) = SOA 4 pA. (16.43) 


The amplitude Aç is the expectation value of the quark operators between |B) and 
(f | states. Itis computed with the help of quark diagrams described at the beginning 
of this section. They have a weak phase coming from the CKM-matrix elements 
and perhaps a phase of strong origin from final-state interactions. We shall denote 
the ratio 


jan, (16.44) 


In the case of the B? mesons the factors f+(t) simplify, because 


Tys rL =r 
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and 


nosece eM), (16.45) 


The rate for detecting a decay to |f) at a time t after the production of a B° is 
proportional to 


1 
T(t) © z u — S¢sin(Am t) + Cr cos(Am t)] (16.46) 


and, for the detection of a decay from an initial B®, 


Š 1 
T(t) © xe U + S-sin(Am t) — Cpcos(Amt)], (16.47) 
with 2 
21m( 2p) 1— 4p 
Ss=—-P 4 ad C= aa (16.48) 
1+|£p 1+|4p 
p p 


In both of these equations there is a time-oscillation superimposed on the expo- 
nential decay. The formulas are very general and can be applied to several decays. 
We consider some decays in detail. 


(a) The gold-plated channel. For the decay B > J/ip + K, there are tree and penguin 
diagrams. As we discussed in this section, all dominant diagrams have zero CKM phase 
(see Eqs. (16.36) and (16.39)). In addition, q /p is given by Eq. (16.13) and we obtain 

Sp = —sin(2B), Cr =0 
and 
T(t) ~ e [1 + sin(2B)sin(Am t)]. (16.49) 
Thus the time evolution of the B? state and its conjugate particle to a common final state 
is an efficient method for identifying CP parameters. The asymmetry 
rO- îO 
r+) 


has a sinusoidal dependence on time. This decay has been measured in the BaBar and 
Belle experiments, giving the average value 


= sin(2ß)sin(Am t) (16.50) 


sin(2B) = 0.73 + 0.03. (16.51) 


The angle 6 extracted from the asymmetry is one of the angles in the unitarity triangle. 
It enters the calculation through the element Vig given in Eq. (9.53). 

The unitarity triangle is constrained by other measurements as well; to be precise, 
by the magnitude of V», the parameter £, and the mixing of By and By states. Each 
of these quantities determines a region and their intersection defines the apex of the 
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triangle. This figure has become very popular and is featured in Fig. 9.2 and in many 
articles (Branco et al., 1999; Kleinknecht, 2003; Brower and Faccini, 2003). The value 
of $ in Eq. (16.51) has several solutions and one of them coincides with a direction that 
goes through the apex of the triangle. The significance of the sin(26) measurement is 
that for the first time a large CP asymmetry has been observed, proving that CP is not 
an approximate symmetry of nature. 

(b) Decays to ntn and 71°70. The B — m7 decays can be analyzed in a similar manner to 
the K — nn decays. For the sake of brevity we shall use a similar notation; however, the 
numerical values for the quantities (amplitudes, phases, etc.) in B decays are different. 
The amplitude for B? —> ntr" is written as 


A [2 ; 2 . 
Antr ) = ,/ Z Aoli — | Ao jei) (16.52) 
3 V3 
and 
= _ 2 z 1 . 
Änt ) = y 3l4ole ®t + ata: (16.53) 


The phases 59 and 62 come from strong interactions of the final states, but at this high 
energy they cannot be related to 77 phase shifts. The phases 6) and 6, come from 
weak interactions, which originate from CKM couplings. The tree diagram contributes 
to both the 7 = 0 and the J = 2 amplitude and the penguin diagram only to the J = 0 
amplitude. 

The attempt to determine the isospin amplitudes by comparing decays of charged 
and neutral B mesons has met with limited success. This approach is analogous to that 
for the K — nmr decays, in which, after the isospin analysis of the amplitudes, we had 
to return to the effective QCD Hamiltonian. 

There are also analyses in terms of Feynman diagrams. We have already noted that this 
decay mode receives contributions from tree and penguin diagrams. In the Wolfenstein 
parametrization all diagrams are of order A>. The tree diagram has the phase e~'”. The 
penguin diagrams have intermediate states with up, charm, and top quarks, each with a 


different phase. The relevant parameter for this decay has the general form 
Adan . —iy h id 
TAUTE J = gp ee NE (16.54) 
p Antr ) etl” + hei? 


with h being the ratio of a term from the penguin diagram to the remaining contributions 
and 6 a phase of strong origin. The interference of mixing with the decay amplitude 
gives 


S(ntm )4#0 and Crntn) £0. (16.55) 


Consequently, the presence of both cos(Am t) and sin(Am t) terms with coefficients 
different from zero is an indication of CP violation in the B amplitudes. For very small 
values of h, this mode fixes 


St = sin(2B + 2y), 
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which, through the triangle relation 6 + y = x — a, can be replaced by œ. Thus this 
decay mode can determine 6 + y and, indirectly, the angle a. 

(c) Decays to other channels. In the previous discussion we demonstrated that 6 is accurately 
determined and y, or alternatively a, can be extracted from the B —> nmr decay, provided 
that the hadronic matrix elements are better understood. There are extensive efforts to 
analyze and understand other decay modes. For instance (Fleischer, 2003), 


B> 6+k, 


has only penguin contributions. An analysis similar to that of the JApK, mode reveals 
many similarities. In the absence of new physics, the asymmetries for charmonium and 
c-meson decays should be equal. 

This heralds a new era, in which the theory is expected to be scrutinized through a 
variety of other B4 and B, decay asymmetries. Impressive progress is being made in 
the search for asymmetries in the aforementioned modes and in addition B°* —> Kr 
and B — DK~. Current measurements are approaching the stage of cross-checking 
the theory or even discovering physical phenomena beyond the standard model. 


16.6 Mass and lifetime differences for B, mesons 


The analysis of the previous sections can be extended in a straightforward way to 
the B, mesons, which present a very interesting mesonic system. Their masses and 
lifetimes are very similar to those of the Bg mesons, but their weak interactions 
are different because the CKM couplings are now much bigger. A consequence is 
the larger mixing in these states and, it is hoped, a lifetime difference between two 
physical particles that may be measurable. The reduced matrix element is given 
now as 
8 2017 2 
Xg, = 3 lB. | byud — ys)s|0)|" Bs (16.56) 


and is expected to have a numerical value close to Xp, because the wave functions 
and the general structure of the bound state are expected to be similar. The coupling 
which appears in the box diagrams is 


El = Va Vč = —Ad?’, (16.57) 
which is much larger than the coupling in Eq. (16.27). The mass difference is given 


by a formula analogous to Eq. (16.26) and, on taking the ratio, one obtains 


AM, FM, 2 


AM, F2 Mg 


Vis 
Via 


(16.58) 


The ratio of the hadronic matrix parameters is unity in the SU(3) limit or the heavy- 
quark limit. A small deviation from unity may still show up, but this will be of order 
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10%-—20%. The large change comes from the couplings 


2 1 
xfa =p 7 20-30. (16.59) 

At present there is an experimental bound, AM,/AM, > 21.2, coming from the 
mixing of B? and B?-particles, which is close to 100%. Serious experimental efforts 
are being devoted to searching for a precise value of the mixing, which will also 
determine the ratio x, = AM/T. Since the mixing approaches 100%, a precise 
measurement is required in order to extract a value for the mass difference (see 
Eq. (16.23)). 

The estimate which makes x, large also increases the width difference of the 
two states. The two B, states can mix to form two distinct eigenstates. To a first 
approximation, 


Vis 
Via 


AT, 3 a 
: n( =) ~ 3.7 x 1073 (16.60) 


has been calculated from equations analogous to Eqs. (16.26) and (16.32). The 
width difference can be rescaled to give 


2, 
Tr) 2 Nm J LAMAN. T JND 


The ratio of the mass difference was estimated in this section, and assuming Ta © I’; 
leads to a width difference that is experimentally interesting. Its value is in the range 
7%-14%. Alternative estimates that saturate I"}2 with physical intermediate states 
or form other ratios also give encouraging results. There is a good chance that the 
lifetime difference ty — T, is 10% of the lifetime of Bg mesons. Such a large value 
would be measurable in the decay 


B, > J/p +o. 


Problems for Chapter 16 


1. Carry out the integrals i | fO dt and show that the last term in Eq. (16.8) follows. 
Hint: it is easier to integrate exponentials and then take real parts. 

2. Write the matrix element for the mixing of the B? > B? system, especially the CKM 
couplings of the box diagram with the top quark in the intermediate states. Then prove 


the ratio 
1 
q (=) — Ved 
P A VŠ 


3. Calculate the amplitude for the diagram in Fig. 16.1 (a) with charm and up quarks in the 
intermediate states. Then integrate over the two-body phase space to obtain some of the 
terms in Eq. (16.30). 
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4. The decay Ba > J/ + Ks has tree and penguin diagrams. We denote the tree diagram 
by 
T = Vin Ves8 
and the penguin diagram by 


P= Vin Vis f(m) + VŠ Ves fm.) T Vin Vas f (mu), 
where f(m,) are functions from the calculation of the penguin diagram with m, the 
mass of the g quark in the loop. A naive order-of-magnitude estimate of the tree and 
loop diagrams gives (Gronau, 1992) 


f(m) r æs(mp) in( *) ~ 0.04. 
g 670 Mp 


Others obtained larger values by computing the penguins with the effective Hamiltonian 
(Kramer and Palmer, 1995). 
(i) Use the unitarity of the CKM matrix to eliminate Vý Vs and rewrite P in terms of 
the other two CKM factors. 
(ii) Show that the ratio of the two CKM factors is 
Vib Vas 


~a? + 004). 
Vou Ves 


Prove that the total amplitude 


(J/WKs|H|B) = VaVeslg + (f (me) — fmd), 


which to order A? is real. 
(iii) Use this form of the matrix element to calculate the asymmetry in Eq. (16.50). 
5. Analyze the B > + K, decay along the lines of Problem 4 and derive Eq. (16.54) 
with an explicit expression for the function h. 
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17 
Higgs particles 


17.1 Higgs-boson couplings 


The electroweak theory depends crucially on the Higgs mechanism. Many aspects 
of the theory have been tested in experiments to a high degree of accuracy, especially 
properties of the gauge bosons and their couplings to fermions. However, the Higgs 
particles have not been discovered yet. In Chapter 7 we discussed the simplest 
and, perhaps, the most natural way of breaking the SU(2) symmetry, namely by 
introducing a doublet of scalar particles. Among them three fields were eliminated, 
becoming the longitudinal degrees of freedom for the gauge bosons. The remaining 
neutral particle is physical and should be observed. In the same chapter it was 
shown that giving the Higgs field a vacuum expectation value generates masses for 
W+ and Z°, whose ratio is related to the weak mixing angle. This ratio has been 
confirmed experimentally, which provides strong support for the underlying SU(2) 
symmetry. 
The Higgs doublet contains two complex fields: 


+ 
($0). (17.1) 


The breaking of the symmetry introduces the vacuum expectation value 


v 


°) =, 17.2 
($) W (17.2) 


The scalar Lagrangian introduced in Chapter 7 has two parameters, u and A, which 
are related to the value of the field at the minimum: 


1 


v= z = (v26) `. 
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The expansion of the field around the minimum of the potential 


,_ l 0 
$ =(.2H) (17.3) 


introduces the physical field H as a fluctuation around the minimum. The numerical 
value of v was determined in Eq. (8.20), leaving the mass of the scalar as the only 
undetermined parameter of the Higgs sector, 
Mii u 3. 4 2 
=— Avy = pî. 17.4 

5 i ae (17.4) 
Consequently, all couplings of the Higgs particles are rewritten in terms of My 
and other coupling constants already determined in previous chapters. After a short 
study of the couplings, one finds 


Wy 
=== = = H? ig Mwgw 
Wu 
70 
Za 0 igMz 
H cos Ow Juv 
70 
H? 
x 
N 
N 
pa M 
, H ~i3!v 
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f 
EEE H0 -im 
f 
with 
v= Laar e = 246 GeV. 


VA 


With so many couplings depending on a single parameter, we can proceed to 
calculate the decays of Higgses to other particles as well as their production cross 
sections. In this way we find processes in which the Higgs particles must be pro- 
duced, provided that they are not very heavy, or they should appear in the decay 
of ordinary or new heavy particles. All these approaches have been pursued ac- 
tively, but no Higgs particles have been discovered yet. Instead we have bounds on 
their masses and the production rates. We shall cover several of these topics in this 
chapter. 

A very important role in all such studies is played by the Higgs mass. There are 
efforts to limit the mass by appealing to unitarity or demanding that the electroweak 
theory remains renormalizable. An alternative direction of investigation studies the 
effects introduced by radiative corrections to several processes. Higgs particles, 
together with quarks and gauge bosons, appear as intermediate states of Feynman 
diagrams. Many experiments have reached a high level of accuracy so that radia- 
tive corrections must be included in order to bring agreement between theory and 
experiment. They provide a crucial test of the theory and impose constraints on the 
Higgs mass. We study several of them in the next section. 


17.2 Precision tests of the theory 


The electroweak theory introduces several parameters that are undetermined. 
Among them are the mass of the Higgs and the masses and mixing angles of 
the fermions. The gauge couplings and the masses of the gauge bosons enter in 
many reactions, where they are determined precisely to have the same value. As 
two examples we mention the weak mixing angle sin?@w and the Fermi coupling 
constant. They have been measured in various reactions so precisely that higher- 
order corrections are necessary. The loop corrections include quantum corrections 
of the theory testing it to a higher degree of accuracy. In this section we review 
the precise measurements for several parameters and point out reactions where 
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discrepancies may appear on the horizon. At present the agreement is so good that 
we can consider the results from such tests a great success of the theory. 

In order to make predictions, we use three quantities as input parameters. They 
are defined in various processes where they can be determined precisely, including 
radiative corrections, and then they are used to predict other processes. 


(i) The first is the fine-structure constant œ measured in experiments that involve the 
Josephson junction or the anomalous magnetic moment of the electron. It receives 
corrections from strong interactions and varies with the momentum at which it is 
measured in a way similar to the strong coupling constant described in Section 11.2. 

(ii) The second parameter is the Fermi coupling constant determined from the muon life- 
time. The corrections include electromagnetic and weak effects. 

(iii) As a third parameter, we can select one of the masses Mw or Mz. It is customary 
to choose Mz determined from the Z lineshape, because it has been measured more 
accurately. 


With these three parameters we can predict other quantities. For instance, 


sin’6w cos26 sie (17.5) 
w= > . 
J/2GE M; 
and My = Mz cos @w, which is also rewritten as 
My 
Da eer eee eae l. 17.6 
j Mi cos?6w CT 


The p parameter determines the strength of neutral currents relative to charge 
currents and to lowest order (tree level) it has the value unity. It is modified by cor- 
rections, as we will mention below. Measuring quantities and carrying out precision 
tests of the theory is a research field in itself. I decided to present here the general 
features that enter such calculations and mention a few comparisons, especially 
those for which there are small disagreements. The reader who wishes to special- 
ize in this field can consult the corresponding section in the particle data group 
(Erler and Langacker, 2004) or books devoted to this topic (Bardin and Passarino, 
1999). 

We have seen in previous chapters that one-loop radiative corrections involve 
integrals of the form 


1 
d*k —— 17.7 
J (k? + m2 + is)” CTA 


(see, for instance, Sections 14.7 and (15.7)). When «œ < 2 the integral diverges. In 
a renormalizable theory the infinities are absorbed as corrections to masses and 
coupling constants. Since they are arbitrary, it appears, at first glance, that the 
corrections are unobservable. An exception to this rule occurs when masses and 
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Figure 17.1. Self-energies for gauge bosons. 


coupling are not free but are related to each other. As a first example, we mention 
the equality of the vector couplings occurring in muon decay and B-decay. They 
are used to determine a precise value for Vaa as given in Section 9.3.1. 

Another accurate example is provided by the p parameter in Eq. (17.6). It receives 
corrections from the self-energies shown in Fig. 17.1, where the intermediate solid 
lines represent quarks and the dotted lines Higgs bosons. It is sensitive to the masses 
of particles in the intermediate states and provided a benchmark for restricting the 
masses of these particles. The above-mentioned corrections give rise to deviations 
from unity given by the equation 


Gef > > 2mm? . (m3\  3Mwsin’Ow , (MA 
=1+--, — ] — ] -)], 7.8 
ae Q Pme aam a oe a aS 


with m and m; the masses of the top and bottom quarks, respectively, and My the 
mass of the Higgs boson. Other quark pairs also contribute, but the magnitude of 
their correction is smaller. 

With the above inputs, sin’Ow and Mw can be calculated when values for m? 
and My are given. Such arguments were used to constrain the top quark’s mass be- 
fore its discovery. Experimental results from the Large Electron—Positron Collider 
(LEP) combined with loop contributions restricted the mass m, to within the range 
160-180 GeV where the top quark was discovered. The standard-model prediction 
now is 


m (m) = 174.3 + 3.4GeV. (17.9) 


Furthermore, the precision electroweak data accumulated at the LEP, SLAC, and 
the Tevatron strongly support the standard model with a weakly coupled Higgs 
boson. As described above, the Higgs boson contributes to the W~ and Z vacuum 
polarization through loop effects. The result of a global fit is shown in Fig. 17.2 
and yields the value (Eidelman, 2004) 


My = 12617, GeV, (17.10) 


which is consistent with the lower bound of 114.4GeV established in direct 
searches. These values provide a benchmark for designing experiments aiming 
at the discovery of the Higgs particles. 
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Figure 17.2. The result of the global fit presented as Ay? = x? — on versus My. 


Before leaving this section, it is worth mentioning two cases in which, if 
we look at the results in detail, they are either not satisfactory or could indi- 
cate small disagreements with the predictions of the standard model. A neutrino 
experiment at Fermilab completed a precise measurement of sin?6w using the 
R ratio of Eq. (12.17) and other ratios. A 30 deviation from the standard-model 
prediction was reported. This conclusion depends on several theoretical param- 
eters entering the analysis that are now under active investigation (Gluck et al., 
2005). 

A further discrepancy appears in the determination of the effective weak mixing 
angle. The adjective “effective” indicates that a specific renormalization scheme has 
been introduced for the higher-order corrections. The two most precise measure- 
ments of sin? Oet from SLAC from polarized-electron asymmetry Arr and from the 
forward—backward asymmetry Abs in the production of b-b pairs differ by ~3c. 
In fact, the result from Arr is in good agreement with the leptonic asymmetries 
measured at the LEP, whereas all other hadronic asymmetries are better compatible 
with the AP,. 

Besides the efforts for discovering the Higgs bosons, it is also important to 
confirm or eliminate these discrepancies. There are theoretical corrections that 
have been pointed out and are being actively discussed. Their implications are 
so profound that one should continue the investigations. The absence of Higgs 
particles in the mass range discussed in this section and/or the persistence of the 
discrepancies will be an indication of new physics. A promising candidate for new 
physics is supersymmetry, whose presence is capable of modifying the predictions 
described in this section. 
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17.3 Bounds on masses from general principles 


There are many new reactions that are possible in the electroweak theory. At tree 
level the amplitudes involve a few partial waves and must satisfy unitarity bounds. 
For elastic WW scattering the bounds have been exploited effectively to restrict the 
mass of the Higgs boson. 

At very high energies the longitudinal polarization of the W bosons is ef ~ 
p"/Mw,. It produces the fastest growing amplitudes as functions of energy and 
involves one partial wave. The scattering amplitude can be expanded in terms of 
partial waves: 


(0,0) 


M(s, t) = - 5 u + 1)ae(s)Pe(cos 0). (17.11) 
l=0 


The fact that each partial wave must satisfy a unitarity bound leads to the condition 
1 
lal = 5 (17.12) 


and supplies an upper bound for the mass of the Higgs boson (Lee et al., 1977), 


An J/2 


x (1.2 TeV}. (17.13) 
GF 


2 
Mý < 


This is the simplest bound described also in Problem 2. It can be improved (Lee 
et al., 1977) by considering the coupled channels WLWL, ZLZL, ZLH, and HH, 
with the subscript L indicating longitudinal polarization, then demanding that the 
eigenvalues of these coupled channels satisfy the unitarity conditions. The improved 
bound is smaller by a factor of 1//3, i.e. 


My < 700GeV. 


New bounds are obtained by requiring stability of the vacuum. We saw in 
Chapter 5 that the Higgs potential must have the specific form of Fig. 5.1 in order to 
be able to expand the field around a minimum of the potential. This property must 
be preserved even when radiative corrections are included. The coupling constant 
à receives corrections from loop diagrams and it becomes a “running” coupling 
depending on the scale A at which it is evaluated. The change of A is given by a 
differential equation that depends on the Yukawa coupling (quark loops) and the 
gauge coupling (gauge loops). The corrected potential has the form 


Vet = -u° H? + X(A)H* (17.14) 


and its minimum defines again a vacuum expectation value: ($) = v/V2. If 
the corrections are large, it is possible to obtain a negative A and consequently 
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an unstable ground state. On the other hand, for à large and positive there 
is one trivial minimum at the origin. Since V(0) = 0, a non-trivial minimum 
exists provided that V(v//2) < 0. This condition gives the lower bound (Linde, 
1976) 


My Z 50GeV with A=1TeV. 


Experimental searches for Higgses have gone much above this value. The best value 
available comes from the reaction in Eq. (17.15) to be discussed later on. 


17.4 Decays 


At the beginning of this chapter we listed several couplings that depend on a single 
parameter My. A characteristic property in models with one Higgs doublet is that 
Higgs particles couple strongly to the heaviest particle. For instance, the couplings to 
fermions prefer the top quark and the couplings to W and Z bosons are proportional 
to their masses. With so many couplings available, it is straightforward to compute 
decay widths and production rates. We list in Table 17.1 decay widths for several 
channels. 

The decays have special properties worth mentioning. The lifetime of the Higgs 
particle is very short and, being neutral, it leaves no visible track. Its decay products, 
such as bb or tt pairs, produce detectable tracks. They are used as signatures for 
discovering the Higgs boson. 

The decays to vector mesons also have several interesting properties. The 
formulas for decay widths into the weak vector bosons Z and W look quite similar, 
except for an additional factor of 4, accounting for the symmetric final state in the 
case of two identical Z bosons. In the limit mz, mw « my the width of the ZZ pairs 
is half of the WtW- width. The vector bosons have three polarizations and it is 
interesting to investigate the decays to final states with specific polarizations. The 
longitudinal polarizations are created by the Higgs particles and may show irregu- 
larities. For large Higgs masses the vector bosons in the final state are dominantly 
longitudinally polarized, which may be important for distinguishing this process 
from the background. 

The decays into fermions or weak vector bosons proceed through tree diagrams, 
in contrast to the decays in groups (4) and (5), which contain loop contributions. 
For loop contributions one expects the branching ratios to be smaller, which is 
indeed the case. However, for My < 200 GeV they are still measurable because the 
sum over all possible particles inside the loop brings an enhancement. The gluonic 
decays are drowned in a huge background, in contrast to photonic decays with two 
stable particles, which provide a convenient way for identifying the Higgs. Both 
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Table 17.1. Partial decay widths of the Higgs boson 


Partial decay width T 


Higgs boson H decays into... with A; = m?/m} 
(1) fermions f: H —> ff N: —— m 5 mzmy(1 — 44)? 
N, is a color factor; Ne = Tror epon 
3 for quarks 
G 
(2) weak neutral bosons Z°: ae 4dz)'/? (1243 — 4az + 1) 
162 /2 
H > ZZ? 
G 
(3) weak charged bosons W+: Fn — Adw)'/? (12a — 4aw + 1) 
8/2 
H > WtW- 
2 
G (m2 2 
(4) gluonsg: H > gg mi id 5 l 
(via a loop containing quarks q) 36r /2 m q 
with I, = 3[2A; + Ag(4a, — I) fA,)] 
2 
—2[sin1//49)| à> 1/4 
a= “a 
5 [In(n*/n-) — ix], A < 1/4 
yn =1/24+/1/4-2 
GF m3 
(5) photons y: H —> yy a> y Iz? 
m 


with J = X O71, +X Q? h + Iw (+1) 
q l 

Qi; is the charge of particle i and 

Iq = 3[2Aq + Aa l4 — DFA,)| 

l = 2h) + l4 — 1) fr) 

Iw = 3aw(] — 2aw) f(Aw) — 3Aw — 1/2 


photonic and gluonic modes proceed through loop diagrams; some of them are 
shown in Fig. 17.3 (Gunion et al., 1990; Hinchlife, 1998). 

The relative importance of various decays depends on the mass of the Higgs 
particles. We present in Table 17.2 dominant decays to various modes as a function 
of the Higgs mass. In the low-mass range the decay to bb pairs dominates and 
is identified by its decay products. The decay to yy is less frequent (~ 107°) but 
has a smaller background. For higher masses, decays to gauge bosons dominate. 
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Table 17.2. Higgs decay modes as a 
function of its mass 


my (GeV) Decay 

90-120 H > bb, yy 

120-140 H —> bb, WW*, ttt 

140-180 H > bb, WW*, ZZ*, tht 

180-380 H —> WW, ZZ 

>380 H > WW, ZZ, tt 
oe , oa 
Y W Y 


Figure 17.3. Some triangle diagrams for decay of a Higgs boson to photons. 


Some decay modes may contribute even below the threshold for pair production of 
gauge bosons because some of them are present as virtual intermediate states. We 
denote them with a star as superscript; for instance W*, which subsequently decays 
toe vy. 

The calculation of decays proceeding through tree diagrams involves a matrix 
element and a two-body phase space, which are straightforward to calculate. We 
suggest some of them as exercises at the end of the chapter. 
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The discovery of the Higgs particle is of paramount importance. For this reason there 
have been large experiments designed to discover them. The early expectations of a 
light Higgs have been ruled out experimentally. A light Higgs appears in the decay 
of the known particles, like mesons, charmonium, and the gauge bosons. Intensive 
searches in these decays did not produce any evidence for Higgs particles. The next 
possibility is to produce them in electron—positron colliders through the process 
shown in Fig. 17.4: 


ete > V*>V+H with (V = W, Z) 
L> leptons (17.15) 
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`H 
Figure 17.4. Bremsstrahlung from weak bosons. 


which is known as bremsstrahlung from the weak gauge bosons. For lower energies 
(./s ~ 200-500 GeV) bremsstrahlung dominates and has been searched for at the 
LEP collider at CERN. The cross section is maximal at center-of-mass energies 
close to the Z resonance, where it is given by (Gunion et al., 1990) 


G2M3 8k k? + 3M2 
o(ete” > ZH) = -5 [1 + (1 —4sin’6w)’] - Z . 
967 v5 | (s — M3) + M2r? 


(17.16) 


Here k is the center-of-mass momentum of the Z boson produced. The detection of 
this channel observes the outgoing Z and reconstructs the Higgs boson’s invariant 
mass as 


M2 = s — 2V s Ez + MŽ. (17.17) 
The searches at CERN established the bound 
My > 114.4 GeV. 


A similar process occurs in hadron colliders, where the initial leptons are replaced 
by quarks to give 


qĝ > V* > V +H. 


For center-of-mass energies higher than 500 GeV, the fusion process 


ete > vý +H, 

shown in Fig. 17.5, begins to dominate. For high energies the initial partons or 
leptons radiate not only photons but also W and Z gauge bosons. Here one must be 
careful to include all possible diagrams in order to satisfy gauge invariance. The 
calculation follows a method very similar to that for the emission of photons, in the 
Weizsacker—Williams approximation, with special attention paid to the longitudinal 


and transverse polarization (Gunion etal., 1990). In this way one derives an effective 
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Figure 17.5. W—W fusion. 


W-emission approximation, which leads to the cross section (Wilczek, 1977; Cahn 
and Dawson, 1984; Jones and Petkov, 1979) 


3444 
o(ete- > vvH) ~ retin oe) = 2]. (17.18) 
4/273 Mi 
This reaction is attractive because it has a low background. It starts smaller than 
the bremsstrahlung reaction in Eq. (17.15), but grows with increasing energy to 
become dominant at ./s ~ 500 GeV. 
Finally, it is possible to create the Higgs particle in photon fusion: 


vy > H > bb. 


For low-energy photons this process is very small. However, for intense high- 
energy laser beams it has a resonance structure and may become important. For 
Higgs masses larger than 150 GeV the dominant final states are two gauge bosons. 


17.6 Production in hadron colliders 


The high-energy electron—positron collider (LEP) completed its runs and set the 
bound of 114.4 GeV. The next searches will be taking place in hadron colliders. 
Searches are already taking place at the Tevatron and a large hadron collider (LHC) 
is under construction at CERN. Several of the reactions in the previous section 
become hadronic reactions once we replace the initial leptons by quarks or quark— 
antiquark pairs. 

In hadronic collisions the process of gluon fusion is the most important source 
for Higgs-particle production. In this process the Higgs couples via a quark triangle 
diagram to the gluons. The point-like cross section at parton level is (Kniehl, 1994) 


m? rH > 88) . 


e_ M2 
7 M; ($ — My). (17.19) 


o(gg > H) = 


For the total hadronic cross section it is very important to take higher-order gluonic 
corrections into account, because their contribution is positive and increases the 
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cross section. The cross section decreases with increasing Higgs mass, although 
the Yukawa coupling grows with the loop-quark mass. 

At the Tevatron, a pp collider at Fermilab, the two efficient mechanisms are 
gluon fusion and associated production with a W or a Z: 


gg > H > bb, (17.20) 
qq > W*/Z* > W/Z +H. (17.21) 


Although the gg process has the larger cross section, ~ 1 pb (one picobarn) at 
My = 115 GeV, it is hard to detect it for My < 130 GeV. In this mass range the 
dominant decay is H —> bb (see Fig. 1 in Hinchlife (1998)), which is swamped 
by a multijet background. For these masses only the production of a Higgs in 
association with a vector boson has enough sensitivity. The WH and ZH channels 
give a clear signal with lepton(s), missing neutrino(s) and two b-jets. The searches 
continue at the Tevatron, where there are plans to reach an integrated luminosity of 
~8fb-!, 

The Tevatron has the potential of discovering the standard-model Higgs with 
masses less than ~170 GeV. Heavier Higgses must wait for the LHC, which will 
reach a center-of-mass energy of 14 TeV. It will run for an integrated luminosity of 
300 fb! and can discover Higgs particles with masses up to 1 TeV = 10° GeV. At 
these high energies, in addition to the gluon—gluon fusion, the quarks radiate gauge 
bosons. A new process of W—W fusion into Higgs, analogous to the one shown in 
Fig. 17.5, 


W+W>H- yy, ZZ*,... (17.22) 


becomes significant. If the standard-model Higgs has a mass above twice the 
Z mass, the discovery will be through the channel 


pp —> H + hadrons > ZZ + --- 3 (IONUT) +. (17.23) 


This is called the golden channel for Higgs production and decays. Both lepton 
pairs will have the mass of the Z boson, making possible the reduction of many 
backgrounds. 

If the mass of the Higgs boson is much bigger than the mass of the W or Z boson, 
My > 2My, the width is 


3G My Ñ“ 
rH > WW) z F M? ~ 0.48 Tev( g | (17.24) 


l6nv2 " 1 TeV 


This width grows rapidly with the mass of the Higgs particle and is very broad for 
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large masses. The increased width of the Higgs and the reduced production rate 
determine the upper limit for detecting a heavy Higgs. 


17.7 Other symmetry-breaking schemes 


There is so much supporting evidence for a Higgs particle with a mass lower than 
1 TeV that the chances of discovering it at the LHC are very high. If the Higgs 
is not of the simple form discussed in this chapter, there are strong arguments 
from unitarity and from higher-order corrections that another mechanism must be 
operating to reduce the fast growth of amplitudes and cross sections. Thus either 
the Higgs will show up as the simple scalar of the standard model, or a more 
complicated structure must appear. 

Extensions of the standard model can have more complicated spectra of Higgs 
bosons. A popular extension is supersymmetry with two Higgs doublets whose 
neutral components have two vacuum expectation values. The physical particles 
are now a charged boson (H*), two neutral scalar Higgses (H? and H9), and one 
pseudoscalar (A) (Hinchlife, 1998). 

Charged Higgs bosons can be pair-produced in e*e~ and qq annihilation. The 
chance of detecting them depends on the energy of the collider and the branching 
ratios to vt, cS, and cb. Searches at the LEP did not discover them and thereby set 
upper limits on their masses. 

In the simplest version of the supersymmetric model the mass of the lightest neu- 
tral scalar depends on the top quark’s mass, the ratio of the two vacuum expectation 
values, and the masses of other supersymmetric particles. For Miop = 175 GeV, 
there is a bound My < 130 GeV, provided that the ratio of the vacuum expecta- 
tion values is large. This mass is within the range of the Tevatron and LHC, as we 
discussed earlier. 

We mention a final possibility, whereby the Higgs particle is a bound state of 
quark—antiquark pairs. In such theories a new strong interaction must be present 
to bind the quark pair together. A dynamical symmetry breaking has been formu- 
lated with a top-quark condensate (Bardeen et al., 1996), in analogy with the BCS 
theory of superconductivity. The low-energy effective theory is the standard model 
supplemented with relationships connecting masses of the top quark, W boson, and 
Higgs boson. 

Another extension introduces new quarks (techniquarks) coupled to W and Z 
bosons and bound together by new gluons, the technigluons. There is a new cou- 
pling constant, which runs to become strong at a scale ~1 TeV. The theory is a 
scaled-up version of QCD with heavy fermions U and D imitating the light up and 
down quarks. The strong interactions at 1 TeV cause a spontaneous breaking of the 
SU(2)_ x SU(2)p symmetry of the new quarks, producing heavy color scalars — the 
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new Goldstone bosons (Fahri and Susskind, 1979). Searches for the new scalars 
follow the methods we described in this and the previous section. 


Problems for Chapter 17 


1. Show that the decay rates to transverse (+) and longitudinally (L) polarized W bosons 
are 


2 M2 4M2 N? 
rE > WW =r > w_w_j)= Lw] X 
16x My Mý 


2: M? 2M 2 4M2 3 
TR > WLW.) = = a(i x) (: x) 
647 M2, M2 M2 

rH? > W,W,) = 0. 


Hence, for My >> Mw, the W bosons from Higgs decay are dominantly longitudinally 
polarized. 

2. Show that the amplitude for Wr Ww, > wr WGI in the limit where s, Mi > M2, Mz 
is 


+w- +w-) — 2 s 4 
M(WEW, > WW) = -20m (- -W — a) 
with s and t the Mandelstam variables. If my — ov, the amplitude grows linearly in s. 
Calculate now the / = 0 partial-wave contribution 


1 f° : 2 
ag = iG oy dt M(WEWI =P Wr W,) 


and show that, for s >> MŽ, the result for ao is 
GMĀ 
81/2 
This result together with the unitarity condition (17.12) gives Eq. (17.13). 

3. A simple estimate of the process of gluon—gluon fusion is obtained by calculating the 
Drell-Yan process. Consider the reaction gg — H — yy with the mass My varying, 
i.e. the Higgs is off the mass shell. The point cross section is given in Eq. (17.19) and 
the decay width is listed in Table 17.1. 


(i) Identify Q? in Section 10.5 with MA and write the Drell-Yan cross section. For the 
gluon distribution function adopt the simplified form 


1 7 
fe(x) = 8-0- x). 
X 


Compute the production cross section in proton-proton collisions. 

(ii) Compute the cross section numerically for a Higgs of 130 GeV for pp collisions of 
center-of-mass energy 5-20 TeV. To simplify the calculation, you may treat [(A) 
with à = m? /mĝ as constant. 


ap = 
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Epilogue 


In this book, the reader has had the opportunity to follow the development of the 
electroweak theory and the discovery of several new phenomena predicted by the 
theory. We have analyzed several of them in various chapters. The richness of the 
field and its high level of accuracy have been achieved with the help of several very 
large, very accurate, and refined experiments. 

The potential for discovery has not yet been exhausted because the theory must 
be completed with the discovery of the Higgs particle(s) or some other symmetry- 
breaking scheme. Experiments at the LHC will either discover the Higgs boson 
or find new interactions indicating another mechanism for the breaking of sym- 
metry. There is another observation in addition to symmetry-breaking that demands 
extension of the theory: the mixing and mass differences of neutrinos. Most of our 
colleagues speculate about a larger theory. Grand unified theories will unify the 
electroweak with the strong interactions, at some high energy bringing the three 
coupling constants together. The main issue here is to find predictions unique to 
the grand unified theory that will be verified by experiment. The symmetry of the 
new theory and the particle classification within the group remain open issues. 

An alternative theory is supersymmetry, with many more particles. Supersym- 
metry is a symmetry relating fermions to bosons and must be broken. There are 
many different symmetry-breaking schemes. Predictions of the minimal supersym- 
metric theory will be tested at the LHC. 

A parallel development has been the association of weak and electromagnetic 
interactions with astronomical and cosmological phenomena. For instance, particle 
interactions determine the spectrum of the cosmic background radiation. Further- 
more, with small extensions, the present theory explains the generation of baryons. 
One attractive scenario considers the generation of a lepton asymmetry converted 
into a baryon asymmetry at the energy scale of the electroweak phase transition 
(v = 246 GeV), generating an excess of matter. The interplay between gravity 
and particle interactions is responsible for the formation of large structures. Such 
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developments have brought about a closer cooperation between particle physicists 
and cosmologists. 

I was tempted to include possible new developments that would lead to diverse 
predictions. This is a difficult task, where past experience indicates that Nature 
frequently selects solutions beyond our imagination. I resisted this temptation, and 
decided to stop here with an open-ended conclusion that invites future thought. 


Appendix A 


Conventions, spinors, and currents 


A.1 Conventions 


The space-time coordinates (t, x, y, z) = (t, X) are denoted by a contravariant four-vector 


(c and fi are set equal to 1): 


1 


x” = (x9 x! x? x°) = (t, x, y, z). 


The metric tensor is 


10 0 0 
a Oe Si SOS 26 
PS “|o o ~a op 

0 0 0 -~i 


p“ = (Po, p), Pu = 8w P” = (Po, —p). 
Momentum four-vectors are similarly defined, 


p” = (E, Px, Py, Pz), 


and the inner product 


Pi: P =S Piu Ph = (E1 Ey — Pi Po). 


We frequently meet products of the totally antisymmetric tensor £«gyu (note g; = 


Expy ue’ = —6ô,”, 
ôP 8P 
ppo __ H v 
Eapuve"” = —2 3 
on oF 
se gt a 
Sore er = én 5p ôb . 
ô; ô ô 
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(A.1) 


(A.2) 


(A.3) 


(A.4) 


(A.5) 
5") 


(A.6) 


(A.7) 


(A.8) 


A.2 Dirac matrices and spinors 


A.2 Dirac matrices and spinors 
Anticommutation of y-matrices: 
tye y”} = y"y” + y”y" = 2g", 
vi =iyy'y’y?, {y", y} =0. 
The o-matrix: 


i 
GS Sed. 


Reduction of the product of three y-matrices: 
yiy ry” = SH + ie” y* ys, 
with 
SHY = gP y? 4. gPYy — gy, 


A familiar representation of y -matrices is 


where 


is the 2 x 2 unit matrix. 
The spinors u and v satisfy the Dirac equation, 


(p — m)u(p, s) = 0, 
(p+ m)v(p, s) = 0. 
The normalization of spinors is 
i(p, s)u(p, s) = 2m, 
v(p, s)u(p, s) = —2m, 


and the completeness relation is 


X up, alp, s) = p+m, 


S 


yo v(p, S)v(p, 8) = p-m. 


Ss 
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(A.9) 
(A.10) 


(A.11) 


(A.12) 


(A.13) 


(A.14) 


(A.15) 


(A.16) 


(A.17) 
(A.18) 


(A.19) 
(A.20) 


(A.21) 


(A.22) 
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A.3 Currents 
Vector: 
Ja = V(x) YY (x) = V(x) yoy, V(x). (A.23) 
Axial: 
Jus(x) = Vx)ypys¥ (x). (A.24) 


Decompositions of the currents or products of them are very useful. Let £, = py + Pi 
and qu = Pi, — Pu, then 


= / 1 = / : v 
u(p’)y"u(p) = Fy OP M(E” + io’ qy)u(p), (A.25) 
1 
u(p)y" ysu(p) = Fy HP sa" + iyso"’L,)u(p), (A.26) 
u(p'jio€,u(p) = —ii(p')g"u(p), (A.27) 
ii(p')io"'q,u(p) = a(p')2my" — 2" )u(p). (A.28) 


Additional identities can be found in Appendix A of the article by M. Nowakowski, E. 
Paschos and J. M. Rodriguez (Eur. J. Phys. 26, 545-560, 2005) and in Appendix C of this 
book. 


Appendix B 


Cross sections and traces 


B.1 Cross sections 


The matrix element M appearing in cross sections and decay rates is Lorentz-invariant 
and dimensionless. The expression for the cross section is 


1 i o1 n dk, Pk 
= MPOrm)t8t -Ny ki a nis. 
löi — ù| 2E,, ZE, POr pi + p->, 2E CnP 2E, Ony. 


i=l 
(B.1) 


do 


The flux factor is frequently computed in the laboratory frame or the center-of-mass 
frame. In general, 


1 - mM (B.2) 
löi =l [pi p} -mM l 
The factor s is 
1 
s= I] il (B.3) 
if there are k; identical particles of species i in the final state. 
The decay width for a particle moving with energy E is 
1 7 dk; dk 
dP = —|M|?(27)*8*| p— Y `k EA Ts. B.4 
IE EA Np 2 2E Ony 2E QAY Be 
B.2 Contraction identities and traces 
dp = 2a - b — pd, (B.5) 
yn = 4, (B.6) 
yyy, = —-2y", (B.7) 
yyy y = 4g", (B.8) 
Ply finaly ye, (B.9) 
VYY yey nn = Uy yty’y? + yy’ yy’), (B.10) 
yrot’y, = 0, (B.11) 
yo’ yy, = 2y’o". (B.12) 
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The trace of an odd product of y”-matrices vanishes: 


Tr(y°>) =0 (B.13) 

Tr(y“y") = 4g” (B.14) 
Tr(o#”) = 0 (B.15) 
Triy“y’y*) = 0 (B.16) 
= ale ee Sere +g"? g”) (B.17) 
Trey’ yy’ yey?) = ish? = 4ieuupe (B.18) 
Trai do - - - don) = Tro, -- 41) (B.19) 
Tr(@y dy - - - Fon) = a1 + a2 THs. - dan) — Gr - 3 Telaga - -< on) ++ — (B.20) 


+ 41 + arn Tr(a, tae don—1) 
=4%_ elan aj)... Gi, aj). 


€ is the signature of the permuatation i1 jı . . . in jn and the sum runs over the (2n)!/(2"n!) 
different pairings satisfying 1 = i} < i2 <---in, ik < jk. 


B.3 Some Feynman rules 


In the text we gave Feynman rules for several vertices. We present here additional rules for 
vertices of gauge bosons: 


A„(kı) 


ie[(ky — ko), Guv + (ko — k3)ugva + (ka — Kr) Gq] 


We (k2) Wy (ks) 


—ig cos Ôw [(ky ka) uv (k2 k3) uur (k3 ky )v9xul 


Wy (ka) W; (ks) 
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Wi wy Ay A, Zy Zy 
ig? Hyp ie? Hy,rp —g’ cos Ow Hardo 

Wy Wy Wt Wy Wr w; 


with Ayyap = 28uv8ap — 8ur.8vp — Sup8vr- In graphs all momenta are taken to be 
entering into the vertices. 


Appendix C 


Identities for quark bilinears 


There are four identities for the product of two quark bilinears: 


[Sy"y?y’ ysd] [qiyu y wrea;] = 48” [Sy ved] [Ginyved,, 


[By y?v’ ved] [dire wedi] = 4[5y ved] [aiv’ yea]. 


-d] ] 
[Svv y" ysd][đivuy wedi] = 4[5v vzd] [iy v=q;], 
-d] ] 


[5v y.y" y 


The derivation follows from the identity 


Pye ye = gy? + gy = gy? + iet ays 
and can be found in A. Sirlin, Nucl. Phys. B 192, 93 (1981). 
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d\[divuy wes] = 48” [Sy* ved] [dinyea)]- 


(C.1) 
(C.2) 
(C.3) 
(C.4) 


(C.5) 
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